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HUMAN TRANSFERASE MOLECULES 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human transferase molecules 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of cdl proliferative 
disorders and immune system disorders, and in the assessment of the effects of exogenous compounds 
on the expression of nucleic acid and amino acid sequences of human transferase molecules. 

BACKGROUND OF THE INVENTION 

10 Transferases are enzymes that catalyze the transfer of molecular groups. The reaction may 

involve an oxidation, reduction, or cleavage of covalent bonds, and is often specific to a substrate or 
to particular sites on a type of substrate. Transferases participate in reactions essential to such 
functions as synthesis and degradation of cell components, regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretion. Transferases are 

15 involved in key steps in disease processes involving these functions. Transferases are frequently 
classified according to the type of group transferred For example, methyl transferases transfer one- 
carbon methyl groups, amino transferases transfer nitrogenous amino groups, and similarly 
denominated enzymes transfer aldehyde or ketone, acyl, glycosyl, alkyl or aryl, isoprenyl, saccharyl, 
phosphorous-containing, sulfiir-containing, or selenium-containing groups, as well as small enzymatic 

20 groups such as Coenzyme A. 

Acyl transferases include peroxisomal carnitine octanoyl transferase, which is involved in the 
fatty acid beta-oxidation pathway, and mitochondrial carnitine palmitoyl -transferases, involved in fatty 
acid metabolism and transport. Choline O-acetyl transferase catalyzes the biosynthesis of the 
neurotransmitter acetylcholine. 

25 Amino transferases play. key roles in protein synthesis and degradation, and they contribute to 

other processes as well. For example, the amino transferase 5-aminolevulinic acid synthase catalyzes 
the addition of succinyl-CoA to glycine, the first step in heme biosynthesis. Other amino transferases 
participate in pathways important for neurological function and metabolism. For example, glutamine- 
phenylpyruvate amino transferase, also known as glutamine transaminase K (GTK), catalyzes several 

30 reactions with a pyridoxal phosphate cofactor. GTK catalyzes the reversible conversion of L-glutamine 
and phenylpyruvate to 2-oxoglutaramate and L-phenylalanine. Other amino acid substrates for GTK 
include L-methionine, L-histidine, and L-tyrosine. GTK also catalyzes the conversion of kynurcnine to 
kynurenic add, a tryptophan metabolite that is an antagonist of the N-methyl-D-aspartate (NMDA) 
receptor in the brain and may exert a neurombdulatory function. Alteration of the kynurenine metabolic 

35 pathway may be associated with several neurological disorders. GTK also plays a role in the 
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metabolism of halogenated xenobiotics conjugated to glutathione, leading to nephrotoxicity in rats and 
neurotoxicity in humans. GTK is expressed in kidney, liver, and brain. Both human and rat GTKs 
contain a putative pyridoxal phosphate binding site. (ExPASy ENZYME: EC 2.6.1 .64; Perry, S.J. et 
al. (1993) Mol. Pharmacol. 43:660-665; Perry, S. et al. (1995) FEBS Lett. 360:277-280; and Alberati- 

5 Giani, D. et al. (1995) J. Neurochem. 64:1448-1455.) A second amino transferase associated with this 
pathway is kynurenine/a-aminoadipate amino transferase (AadAT). AadAT catalyzes the reversible 
conversion of a-aminoadipate and a-ketoglutarate to a-ketoadipate and L-glutamate during lysine 
metabolism AadAT also catalyzes the transamination of kynurenine to kynurenic acid. Acytosolic 
AadAT is expressed in rat kidney, liver, and brain (Nakatani, Y. et al. (1970) Biochim. Biophys. Acta 

10 198:219-228; Buchli, R. et al. (1995) J. Biol. Chem 270:29330-29335). 

Glycosyl transferases include the mammalian UDP-gJucouronosyl transferases, a family of 
membrane-bound microsomal enzymes catalyzing the transfer of glucouronic acid to lipophilic 
substrates in reactions that play important roles in detoxification and excretion of drugs, carcinogens, 
and other foreign substances. Another mammalian glycosyl transferase, mammalian UDP-galactose- 

15 ceramide galactosyl transferase, catalyzes the transfer of galactose to ceramide in the synthesis of 

galactocerebrosides in myelin membranes of the nervous system The UDP-glycosyl transferases share 
a conserved signature domain of about 50 amino acid residues (PROSITE: PDOC00359, 
httpy/«cpasy.hcuge.ch/sprot/prosite.html). 

Methyl transferases are involved in a variety of pharmacologically important processes. 

20 Nicotinamide N-methyl transferase catalyzes the N-methylation of nicotinamides and otho* pyridines, 
an important step in the cellular handling of drugs and other foreign compounds. Phenylethanolamine 
N-methyl transferase catalyzes the conversion of noradrenalin to adrenalin. 6-O-methylguanine-DNA 
methyl transferase reverses DNA methylation, an important step in carcinogenesis. Uroporphyrin-III 
C-methyl transferase, which catalyzes the transfer of two methyl groups from S-adenosyl-L-methionine 

25 ~ to uroporphyrinogen III, is the first specific enzyme in the biosynthesis of cobalamin, a dietary enzyme 
whose uptake is deficient in pernicious anemia. Protein-arginine methyl transferases catalyze the 
posttranslational methylation of arginine residues in proteins, resulting in the mono- and dimethylation 
of arginine on the guanidino group. Substrates include histones, myelin basic protein, and 
heterogeneous nuclear ribonucleoproteins involved in mRN A processing, splicing, and transport. 

30 Protein-arginine methyl transferase interacts with proteins upregulated by mitogens, with proteins 
involved in chronic lymphocytic leukemia, and with interferon, suggesting an important role for 
methylation in cytokine receptor signaling (Lin, W.-J. et al. (1996) J. Biol. Chem. 271:15034-15044; 
Abramovich, C. et al. (1997) EMBO J. 16:260-266; and Scott, H.S et al. (1998) Genomics 48:330- 
340.) 
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Phosphotransferases catalyze the transfer of high-energy phosphate groups and are important 
in energy-requiring and -releasing reactions. The metabolic enzyme creatine kinase catalyzes the 
reversible phosphate transfer between creatine/creatine phosphate and ATP/ADP. Giycocyamine 
kinase catalyzes phosphate transfer from ATP to guanidoacetate, and arginine kinase catalyzes 

5 phosphate transfer from ATP to argenine. A cysteine-containing active site is conserved in this family 
(PROSITE: PDOC00103). 

Prenyl transferases are beterodimers, consisting of an alpha and a beta subunit, that catalyze 
the transfer of an isoprenyl group. An example of a prenyl transferase is the mammalian protein 
farnesyl transferase. The alpha subunit of farnesyl transferase consists of 5 repeats of 34 amino acids 

10 each, with each repeat containing an invariant tryptophan (PROSITE: PDOC00703). 

Saccharyl transferases are glycating enzymes involved in a variety of metabolic processes. 
Oligosacchryl trans ferase-48, for example, is a receptor for advanced glycation endproducts. 
Accumulation of these endproducts is observed in vascular complications of diabetes, macrovascular 
disease, renal insufficiency, and Alzheimer's disease (Thornalley, PJ. (1998) Ceil Mol. Biol. (Ncrisy- 

15 Le-Grand) 44:1013-1023). 

Coenzyme A (CoA) transferase catalyzes the transfer of CoA between two carboxylic acids. 
Sucdnyl CoA:3-oxoacid CoA transferase, for example, transfers CoA from succinyl-CoA to a recipient 
such as acetoacetate. Acetoacetate is essential to the metabolism of ketone bodies, which accumulate in 
tissues affected by metabolic disorders such as diabetes (PROSITE: PDOC00980). 

20 The discovery of new human transferase molecules and the polynucleotides encoding them 

satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of cell proliferative disorders and immune system disorders, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of human 
transferase molecules. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human transferase molecules, referred to 
collectively as 4< HTFS" and individually as "HTFS-1 "HTFS-2," "HTFS-3," "HTFS-4," "HTFS-5," 
"HTFS-6," "HTFS-7," "HTFS-8," "HTFS-9," "HTFS-10," "HTFS-11," "HTFS-12," "HTFS-13," 
30 "HTFS-14," "HTFS-15," "HTFS-16," "HTFS-17," "HTFS-18," "HTFS-19," "HTFS-20," "HTFS- 
21," "HTFS-22," "HTFS-23," "HTFS-24," "HTFS-25," "HTFS-26," "HTFS-27," "HTFS-28," 
"HTFS-29," "HTFS-30," "HTFS-31," "HTFS-32," "HTFS-33," "HTFS-34," "HTFS-35," "HTFS- 
36," "HTFS-37," "HTFS-38," "HTFS-39," "HTFS-40," "HTFS-41," and "HTFS-42." In one aspect, 
the invention provides an isolated polypeptide comprising an amino acid sequence selected from the 
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group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -42, 
b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-42, c) a biologically active fragment of 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-42, and d) .an 
5 immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
42. In one alternative, the invention provides an isolated polypeptide comprising the amino acid 
sequence of SEQ ID NO: 1-42. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino add sequence selected from the 

10 group consisting of SEQ ID NO:l-42, b) a naturally occurring amino acid sequence having at least 
90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
42, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-42, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-42. In one alternative, the polynucleotide encodes a polypeptide selected 

15 fromthegroup consisting of SEQ ID NO:l-42. In another alternative, the polynucleotide is selected 
from the group consisting of SEQ ID NO:43-84. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 

20 consisting of SEQ ID NO:l-42, b) a naturally occurring amino acid sequence having at least 90% 

sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-42, c) 
a biologically active fragment of ah amino acid sequence selected from the group consisting of SEQ ID 
NO: 1 -42, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-42. In one alternative, the invention provides a cell transformed with the 

_25— recombinant polynucleotide. In another.alternatiye, the invention provides a transgenic orgamsm 

comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino add sequence selected from the group 
consisting of SEQ ID NO: 1-42, b) a naturally occurring amino acid sequence having at least 90% 

30 sequence identity to an amino add sequence selected from the group consisting of SEQ ID NO: 1 -42, c) 
a biologically active fragment of an amino add sequence sdected from the group consisting of SEQ ID 
NO: 1-42, and d) an immunogenic fragment of an amino add sequence selected from the group 
consisting of SEQ ID NO:l-42. The method comprises a) culturing a cell undo- conditions suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 
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comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 
recovering the polypeptide so expressed 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 

5 sequence selected from the group consisting of SEQ ID NO:l-42, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-42, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 -42, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-42. 

10 The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:43-84, b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 

15 an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 

20 SEQ ID NO:43-84, b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and c) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 

25 in the sample, .and which probe specifically hybridizes to said target polynucleotide, under conditions 

whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) (Meeting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

30 The invention further provides a method for detecting a target polynucleotide in a sample, said 

target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:43-84, b) a naturally oexurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, c) a 



5 
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polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 
fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
5 amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-42, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 

10 SEQ ID NO:l-42, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-42, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-42, and a pharmaceutically acceptable excipient In one 
embodiment, the composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-42. The invention additionally provides a method of treating a disease or condition 

15 associated with decreased expression of functional HTFS, comprising administering to a patient in need 
of such treatment the compositioa 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acidi sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-42, b) a naturally occurring 

20 amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO:l -42, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-42, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-42. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 

. — 25 -sample. In one alternative, the invention provides a composition comprising an agonist compound 

identified by the method and a pharmaceutically acceptable excipient. In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 
functional HTFS, comprising administering to a patient in need of such treatment the composition. 
Additionally, the invention provides a method for screening a compound for effectiveness as 

30 an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO:l-42, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-42, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-42, and d) an immunogenic fragment 

■ ■ 6 
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of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-42. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a composition comprising 
an antagonist compound identified by the method and a pharmaceutical^ acceptable excipient In 
5 another alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional HTFS, comprising administering to a patient in need of such treatment 
the composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 

10 acid sequence selected from the group consisting of SEQ ID NO:l-42, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-42, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 -42, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-42. The method comprises a) combining 

15 the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 

20 an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -42, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-42, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-42, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-42. The 
—25 method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 

30 compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:43-84, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 

35 expression of the target polynucleotide. 

7 _____ 
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The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 

5 group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 

NO:43-84, ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 

10 hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, 
ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, iii) a 

15 polynucleotide sequence complementary to i), i v) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of 
a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the 
treated biological sample with the amount of hybridization complex in an untreated biological 

20 sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is indicative of toxicity of the test compound, 

BRIEF DESCRIPTION OF THE TABLES j 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
25 clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble frill- \ 
length sequences encoding HTFS. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases Used for analysis of HTFS. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
30 determined by northern analysis; diseases, disorders, or conditions associated with these tissues; and the 
vector into which each cDNA was cloned 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding HTFS were isolated 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
35 polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 
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DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
5 may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
10 reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 

15 any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 

20 not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"HTFS" refers to the amino acid sequences of substantially purified HTFS obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

25 - ~ The term "agonist" refers to a molecule which intensifies or mimics -the biological activity of 
HTFS. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of HTFS either by directly interacting with 
HTFS or by acting on components of the biological pathway in which HTFS participates. 

An "allelic variant" is an alto-native form of the gene encoding HTFS. Allelic variants may 

30 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring fona Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times in 
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a given sequence. 

"Altered" nucleic acid sequences encoding HTFS include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as HTFS or a 
polypeptide with at least one functional characteristic of HTFS. Included within this definition are 
5 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding HTFS, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding HTFS. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent HTFS. Deliberate 

10 amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of HTFS is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 

15 include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

20 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

25 in the art. .__ _ . .. 

_ The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
HTFS. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of HTFS either by 
directly interacting with HTFS or by acting on components of the biological pathway in which HTFS 

30 participates. 

The tern "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind HTFS polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
35 to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
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synthesized chemically, and can be conjugated to a carrier protein if desired Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is thai used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e„ an epitope) that 
5 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

10 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, mcthylphosphonatcs, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 

15 modified bases such as 5 -methyl cytosine, 2-deoxyuradl, or 7-deaza-2 -deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 

20 designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise* "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic HIPS, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 

.25 ,_ antibodies. _ _____ _ _ . ... ... 

"Coixiplementary" describes the relationship betweai two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, S'-AGW pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
30 given amino acid sequence" refer broadly to any composition contaiiiing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding HTFS or fragments of HTFS may be 
employed as hybridization probes. The probes may be stored in fireeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
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deployed in an aqueous solution containing salts (e.g. f NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, * 

5 Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

10 "Conservative amino acid substitutions" are those substitutions that are predicted to least 

interfere with the properties of the original protein, i.e M the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 

15 Original Residue Conservative Substitution 



20 



25 



30 



35 



Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


lie 


Leu, Val 


j Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, Dc 


Phe 


His, Met, Leu, Tip, Tyr 


Ser 


Cys, Thr 


Thr ' 


"""" ~ " Ser; Val ' 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Tip 


Val 


lie. Leu. Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

40 A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 

absence of one or more amino acid residues or nucleotides. 
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The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
5 one modified by glycosylate, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"A "fragment" is a unique portion of HTFS or the polynucleotide encoding HTFS which is 

10 identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, prima-, antigen, therapeutic molecule, orfor other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 

15 residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 

20 embodiments. 

A fragment of SEQ ID NO;43-84 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:43-84, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:43-84 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
-25- SEQ ID NO:43-84 from relate polynucleotide sequences. Theprecise length of a fragment of SEQ 
ID NO:43-84 and the region of SEQ ID NO:43-84 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO: 1 -42 is encoded by a fragment of SEQ ID NO:43-84. A fragment 
of SEQ ID NO: 1-42 comprises a region of unique amino acid sequence that specifically identifies 
30 SEQIDNO:l-42. Forexample, a fragment of SEQ ID NO: 1-42 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:l-42. The precise length of 
a fragment of SEQ ID NO:l-42 and the region of SEQ ID NO:l-42 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

35 A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
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(&g., methionine) followed by an open reading frame and a translation termination codoiL A "full- 
length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

5 The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 

the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

10 Percent identity between polynucleotide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence 
alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
andP.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et aL (1992) CABIOS 8:189-191. 

15 For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 

Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 

20 provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul; S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlmnih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 

25 polynucleotide sequences from a variety of databases. Also avaflable is a tool called "BLAST_2 

Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at htrpy/ww.ncbi.nlmnih.gov/gorf0)12.htiiil. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings.^For example, to 
30 compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-21 -2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
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Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
5 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured ova- a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 

10 lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 

15 a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases percent identity" and "% identity." as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 

20 methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrcphobicity at the site of 
substitution, thus preserving the structure (and therefore Junction) of the polypeptide* 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence alignment 

25 program (described and referenced above), for pairwise ^alignments of polypeptide sequences usihg _ 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

30 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0. 1 2 
(Apr-21-2000) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
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Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

5 Percent identity may be measured over the length of an entire defined polypeptide sequence, for 

example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 

10 supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 Kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

15 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

20 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
binding between pairs of nucleic acid strands that are not perfectly matched Permissive conditions for 

25 annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and_ 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 1 00 ng/ml sheared, denatured salmon spam DNA. 

30 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal mdting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
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* for nucleic acid hybridization are wdl known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning; A Laboratory Manual, 2 nd ed., vol. l-3 f Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9, 

High stringency conditions for hybridization between polynucleotides of the present invention 
5 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jig/ml. Organic solvent, such as 
10 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
15 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g. t 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
20 or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
25" " various factcrere.g., cy^ which may affect. cellular 

and systemic defense systeins. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of HTFS which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
30 HTFS which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array dement" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
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The term "modulate" refers to a change in the activity of HTFS. For example, modulation may 

cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 

functional, or immunological properties of HTFS. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
5 polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 

synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 

antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-likemateial. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
10 linked to a coding sequence if the promoter affects the transcription or expression of the coding 

sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 

necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 

comprises an oligonucleotide of at least about 5 nucleotides in length linked to a pqrtide backbone of 
15 amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 

preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 

may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an HTFS may involve lipidation, glycosylation, 

phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
20 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 

cell type depending on the enzymatic milieu of HTFS. 

"Probe" refers to nucleic acid sequences encoding HTFS, their complements, or fragments 

thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 

isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
25 labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 

short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 

complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 

polymerase enzyme. Prima- pairs can be used for amplification (and identification) of a nucleic acid 

sequence, &g., by the polymerase chain reaction (PCR). 
30 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

be enployed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100. 

or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 

be considerably longer than these examples, and it is understood that any length supported by the 
35 specification, including the tables, figures, and Sequence Listing, may be used 
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Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. ct al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubcl, F.M. et al. (1987) Current Protocols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
5 Protocols. A Guide to Methods and Applications . Academic Press, San Diego C A, PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Prima: (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
10 purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kUobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
15 West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome- wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
20 microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 

UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that j 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 1 

25 JJenc^ JMs pjogrm 

polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray dements, or specific probes to identify fully or partially complanentary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 

30 those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
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such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 

been altered soldy by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 

recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 

Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 
5 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

expressed, inducing a protective immunological response in the mammal. 

A "regulatory dement" refers to a nucleic acid sequence usually derived from untranslated 

regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions (UTRs). 
10 Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 

stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
15 other moieties known in the art 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

20 The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

acids encoding HTFS, or fragments thereof, or HTFS itself, may comprise a bodily fluid; an extract 
from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

25 protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 

30 reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75 % free, and most preferably at least 90% free from other components with which 
they are naturally associated. 
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A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 

5 microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A 'transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 

10 cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term 'transformed" cells 

15 includes stably transformed cells in which the inserted DNA is capable of replication either as an 

autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 

20 acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 

25 -transgenic organisms contemplated in accordance. with the present invention includebacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook, J. et al. (1989), 

30 supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
35 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
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sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant A splice Variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
5 possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 

10 which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

15 set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human transferase molecules (HTFS), the 

20 polynucleotides encoding HTFS, and the use of these compositions for the diagnosis, treatment, or 
prevention of cell proliferative disorders and immune system disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
HTFS. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 

25 nucleic acids encoding each HTFS were identified, and column4 shows the cDNA libraries from which 
these clones were isolated Column 5 shows Incyte clones and their corresponding cDNA libraries. 
Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some 
cases, GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank 
cDNA sequences, where indicated, in column 5 were used to assemble the consensus nucleotide 

30 sequence of each HTFS and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 

35 shows homologous sequences as identified by BLAST analysis along with relevant citations, all of 
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which are expressly incorporated by reference herein in their entirety; and column 7 shows analytical 
methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
proton motifs. 

5 The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

associated with nucleotide sequences encoding HTFS. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Fragments of these nucleotide sequences are useful, for example, in hybridization or 
amplification technologies to identify SEQ ID NO:43-84 and to distinguish between SEQ ID NO:43- 
84 and related polynucleotide sequences. The polypeptides encoded by these fragments are useful, for 

10 example, as immunogenic peptides. Column 2 lists tissue categories which express HTFS as a fraction 
of total tissues expressing HTFS. Column 3 lists diseases, disorders, or conditions associated with 
those tissues expressing HTFS as a fraction of total tissues expressing HTFS. Column 4 lists the 
vectors used to subclone each cDNA library. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 

15 from which cDNA clones encoding HTFS were isolated Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDN A libraries in column 2. 

SEQ ID NO:44 maps to chromosome 1 within the interval from 170.1 to 186.4 centiMorgans. 
SEQ ID NO:46 maps to chromosome 1 1 within the interval from 58.2 to 59.5 centiMorgans. SEQ ID 

20 NO:48 maps to chromosome 1 1 within the interval from 67.4 to 70.9 centiMorgans. SEQ ID NO:49 
maps to chromosome 21 within the interval from 51.6 centiMorgans to the q-terminus. SEQ ID 
NO:52 maps to chromosome 3 within the interval from 63.3 to 77.4 centiMorgans. SEQ ID NO:59 
maps to chromosome 20 within the interval from 50.2 to 53.6 centiMorgans and to chromosome 12 
within the interval from 113.3 to 1 18.9 centiMorgans. SEQ ID NO:60 maps to chromosome 12 

25 within the interval from 62.7 to 70.6 centiMorgans. SEQ ID NO:62 maps to chromosome 1 1 within^ 
the interval from 62.5 to 70.9 centiMorgans. SEQ ID NO:68 maps to chromosome 1 1 within the 
interval from 70.9 to 72. 1 centiMorgans. SEQ ID NO:78 maps to chromosome 23 within the interval 
from 94.4 to 97.4 centiMorgans and to chromosome 2 within the interval from 272.5 centiMorgans to 
the q-terminus. SEQ JUD NO:85 maps to chromosome 5 within the interval from 5.5 to 21 .5 

30 centiMorgans, to chromosome 17 within the interval from 53.9 to 62.9 centiMorgans, and to 
chromosome 12 within the interval from 84.7 to 92.5 centiMorgans. SEQ ID NO:86 maps to 
chromosome 6 within the interval from 42.0 to 45.4 centiMorgans, to chromosome 1 1 within the 
interval from 58.2 to 59.5 centiMorgans, and to chromosome 16 within the interval from 88.1 to 92.6 
centiMorgans. 

35 The invention also encompasses HTFS variants. A preferred HTFS variant is one which has at 
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least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the HTFS amino acid sequence, and which contains at least one functional or structural 
characteristic of HTFS. 

The invention also encompasses polynucleotides which encode HTFS. In a particular 
5 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:43-84, which encodes HTFS. The polynucleotide sequences of 
SEQ ID NO:43-84, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

10 The invention also encompasses a variant of a polynucleotide sequence encoding HTFS. In 

particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding HTFS. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:43-84 which has at 

15 least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 

sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:43-84. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of HTFS. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

20 code, a multitude of polynucleotide sequences encoding HTFS, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
,25 ^occurring HTFS, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode HTFS and its variants are generally capable 
hybridizing to the nucleotide sequence of the naturally occurring HTFS under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding HTFS or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 

30 codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host Other reasons for substantially altering the nucleotide sequence encoding 
HTFS and its derivatives without altering the encoded amino acid sequences include the production of 
RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced 
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from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode HTFS and HTFS 
derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
5 reagents well known in the art Moreover, synthetic chemistry may be used to introduce mutations into 
a sequence encoding HTFS or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:43-84 and fragments thereof undo- various conditions of stringency. (See, ag., WaW, G.M. and 
10 S.L. Bo-go- (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

152:507-5 1 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 

15 DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE Biosystems, 
Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gailhersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno N V), 

20 PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (PE 
Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system 
(PE Biosystems), the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale 
CA), or other systems known in the art. The resulting sequences are analyzed using a variety of 
algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in 

25 Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, RA. (1995) Molecular 
Biology and Biotechnology . Wiley VCH. New York NY. pp. 856-853.) 

The nucleic acid sequences encoding HTFS may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 

30 restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA.within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 



25 



WO 01/32888 



PCT/US00/30485 



Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineer ed double-stranded sequence into a region of unknown 

5 sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art (See, e.g. f Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto C A) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 

10 commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
15 size-sdected to include larger cDNAs. In addition, random-primed libraries, which often include 

sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) library 

does not yield a full-length cDN A. Genomic libraries may be useful for extension of sequence into 5* 

non-transcribed regulatory regions. • 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
20 size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 

sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 

software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire process 
25" "from loading of samples to computer analysis and electronic data display may be_computer contrcdled 

Capillary dectrophoresis is especially preferable for sequencing small DNA fragments which may be 

present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 

encode HTFS may be cloned in recombinant DNA molecules that direct expression of HTFS, or 
30 fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 

the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 

amino acid sequence may be produced and used to express HTFS. 

The nucleotide sequences of the present invention can be engineered using methods generally 

known in the art in order to alter HTFS-encoding sequences for a variety of purposes including, but not 
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limited to, modification of the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 

5 alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

Hie nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C et al. (1999) Nat Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 

10 improve the biological properties of HTFS, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 

15 DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 

breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 

20 maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding HTFS may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g. , Caruthers, M.H. et al. (1 980) Nucleic Acids 
Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 

25 HTFS itself-or a fragment th^eof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. nQfttt Proteins. Structures and Molecular Properties. WH Freeman, New York NY, pp. 
55-60; and Roberge, J. Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 43 1 A peptide synthesizer (PE Biosystems). AdditionaUy, the amino acid sequence of 

30 HTFS, or any part thereof, may be altered during direct synthesis and/or combined with sequences from 
other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a sequence 
of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 

35 The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
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(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active HTFS, the nucleotide sequences encoding HTFS or 
derivatives thereof may be inserted into an appropriate expression vector, i.&, a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 

5 suitable host These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5 ' and 3* untranslated regions in the vector and in polynucleotide sequences 
encoding HTFS. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding HTFS. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

10 sequences encoding HTFS and its initiation codon and upstream regulatory sequ ences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation codons may be of various origins, both natural 

15 and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cdl Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding HTFS and appropriate transcriptional and translational control 

20 elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1 989) Molecular Cloning. A Laboratory 
Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 1 6.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

-25 _ encoding HTFS. JThese include,.but are.not hmit^ Jo, nuCTpprganisms such as bact^ tr^formed 

with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cdl systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 

30 animal cdl systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Nad. 
Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-311; Coruzzi, G. etal. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 

35 (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
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McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 
J.J. et al. (1997) Nat. Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 

5 delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90(1 3):6340-6344; Bullcr, R.M et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

10 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotide sequences encoding HTFS. For example* routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding HTFS can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding HTFS into the vector's multiple cloning site 

1 5 disrupts the lacL gene, allowing a cplorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the cloned sequence. (See, c.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of HTFS are needed, e.g. for the production of antibodies, 

20 vectors which direct high level expression of HTFS may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used 

Yeast expression systems may be used for production of HTFS. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 

25 vectors direct either the secretion or intracellular retention of expressed proteins and enablejntegration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubd, 1995, sues; 
Bitter, supra ; and Scorer, supra .) 

Plant systems may also be used for expression of HTFS. Transcription of sequences encoding 
HTFS may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 

30 combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used (See, e.g., Coruzzi, supra ; Broglie, supra ; and Winter, supra .) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated transfectioa (See, e.g„ 
The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

35 191096.) 
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In mammalian cells, a number of viral-based expression systems may be utilized In cases 
where an adenovirus is used as an expression vector, sequences encoding HTFS may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain , 

5 infective virus which expresses HTFS in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

10 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 1 0 Mb are 

constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J J. et al. (1997) Nat Genet. 15:345-355.) 

For long tain production of recombinant proteins in mammalian systems, stable expression of 
HTFS in cell lines is preferred. For example, sequences encoding HTFS can be transformed into cell 

15 lines using expression vectors which may contain viral origins of replication and/or endogenous 

expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 

20 introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, tfie herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tkr zn&apf cells, respectively. (See, e.g., Wigler, M. et aL (1977) Cell 11:223-232; 

25 Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, ^antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
cMOTSulforon andphospWnotricin acetyltransfo-ase, respectively. (See, e.g., Wigler, M. et al. (1980) 
Proc. Natl. Acad Sci. US A 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 

30 Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e>g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-805 1 .) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B 
glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
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transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed For example, if the 

5 sequence encoding HTFS is inserted within a marker gene sequence, transformed cells containing 
sequences encoding HTFS can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding HTFS undo- the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

10 In general, host cells that contain the nucleic acid sequence encoding HTFS and that express 

HTFS may be identified by a variety of procedures known to those of skill in the art These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein sequences. 

15 Immunological methods for detecting and measuring the expression of HTFS using either 

specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on HTFS is preferred, but a competitive binding 

20 assay may be employed. These and other assays are well known in the art (See, e g., Hampton, R. et 
al. (1990) Serological Meth ods, a Laboratory Manual. APS Press, SL Paul MN, Sect IV; Coligan, J.E. 
et al. (1997) Current Protocols in Immunology. Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques arc known by those skilled in the art and 

25 " may be used in various nucleic acid and amino acid assays. -Meam for produdng la^ed hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding HTFS include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding HTFS, or any fragments thereof, may be cloned into a vector for the production of 
an mRN A probe. Such vectors are known in the art, are commercially available, and may be used to 

30 synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemfluminescent, or chromogenic agents, as well as substrates, 



WO 01/32888 



PCT/USOO/30485 



cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HTFS may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cdl may be secreted or retained intracellularly depending on the sequence 

5 and/or the vector used As will be,understood by those of skill in the art, expression vectors containing 
polynucleotides which encode HTFS may be designed to contain signal sequences which direct secretion 
of HTFS through a prokaryotic or eukaryotic cell membrane. • 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 

10 polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a << prepro ,t or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 

15 Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding HTFS may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric HTFS protein containing a 

20 heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of HTFS activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferasc (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 

25 _ hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal -chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 

30 located between the HTFS encoding sequence and the heterologous protein sequence, so that HTFS may 
be cleaved away from the heterologous moiety following purification Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled HTFS may be achieved in 
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vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

5 HTFS of the present invention or fragments thereof may be used to screen for compounds that 

specifically bind to HTFS. At least one and up to a plurality of test compounds may be screened for 
specific binding to HTFS. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g„ receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 

10 HTFS, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the naturai receptor to which HTFS 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 

15 these compounds involves producing appropriate cells which express HTFS, either as a secreted 

protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or E. 
coli . Cells expressing HTFS or cell membrane fractions which contain HTFS are then contacted with 
a test compound and binding, stimulation, or inhibition of activity of either HTFS or the compound is 
analyzed. 

20 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with HTFS, either in 
solution or affixed to a solid support, and detecting the binding of HTFS to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

25 labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compounds) may be free in solution or affixed to a 
solid support 

HTFS of the present invention or fragments thereof may be used to screen k for compounds that 
modulate the activity of HTFS. Such compounds may include agonists, antagonists, or partial or 

30 inverse agonists. In one embodiment, an assay is performed under conditions permissive for HTFS 
activity, wherein HTFS is combined with at least one test compound, and the activity of HTFS in the 
presence of a test compound is compared with the activity of HTFS in the absence of the test 
compound. A change in the activity of HTFS in the presence of the test compound is indicative of a 
compound that modulates the activity of HTFS. Alternatively, a test compound is combined with an 

35 in vitro or cell-free system comprising HTFS under conditions suitable for HTFS activity, and the 
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assay is performed. In either of these assays, a test compound which modulates the activity of HTFS 
may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding HTFS or their mammalian homologs may 

5 be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 

10 interest disrupted by a marker gene, e.g M the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244: 1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 

15 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
topseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

20 Polynucleotides encoding HTFS may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages dilferentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et aL 
(1998) Science 282:1 145-1147). 

25 Polynucleotides encoding HTFS can also be used to create "knockin" humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease.~With taockih"ta:hn616gy, a " 
region of a polynucleotide encoding HTFS is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 

30 treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress HTFS, e.g., by secreting HTFS in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
35 between regions of HTFS and human transferase molecules. In addition, the expression of HTFS is 
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closely associated with proliferating tissues and inflammation. Therefore, HTFS appears to play a 
role in cell proliferative disorders and immune system disorders. In the treatment of disorders 
associated with increased HTFS expression or activity, it is desirable to decrease the expression or 
activity of HTFS. In the treatment of disorders associated with decreased HTFS expression or 

5 activity, it is desirable to increase the expression or activity of HTFS. 

Therefore, in one embodiment, HTFS or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HTFS. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

10 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
. gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 

15 salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an immune system disorder such 
as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, 

20 dermatomyositis, diabetes mellitus, emphysema, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves* disease, Hashimoto's 
thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel 
syndrome, episodic lymphopenia with lymphocytotoxins, mixed connective tissue disease (MCTD), 
multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, myelofibrosis, 

25 osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's 

syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation/trauma, and hematopoietic cancer including lymphoma, leukemia, and 

30 myeloma. 

In another embodiment, a vector capable of expressing HTFS or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of HTFS including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified HTFS in 
35 conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a 
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disorder associated with decreased expression or activity of HTFS including, but not limited to, those 
provided above. 

In still another embodiment, an agonist which modulates the activity of HTFS may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
5 of HTFS including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HTFS may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of HTFS. Examples of such 
disorders include, but are not limited to, those cell proliferative disorders and immune system disorders 
described above. In one aspect, an antibody which specifically binds HTFS may be used directly as an 
10 antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
cells or tissues which express HTFS. 

in an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding HTFS may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of HTFS including, but not limited to, those described above. 
15 In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
20 disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of HTFS may be produced using methods which are generally known in the art. 
In particular, purified HTFS may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind HTFS. Antibodies to HTFS may also be generated 
25 using methods that arc well known in the art. Such antibodies may include, but are not limited to, 
polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced 
by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are 
generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
30 and others may be immunized by injection with HTFS or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but arc not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
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(bacilli Calmette-Guerin) and Corvnebacterimn parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to HTFS 
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
5 identical* to a portion of the amino acid sequence of the natural protein. Short stretches of HTFS amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced. 

Monoclonal antibodies to HTFS may be prepared using any technique which provides for the 
production of antibody molecules by continuous cdl lines in culture. These include, but are not limited 
10 to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol Cdl Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
15 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; andTakeda, 
S. et al. (1 985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce HTFS-specific single 
20 chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g„ Burton, 
D.R. (1991) Proc. Natl. Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 

25 - - the literature._(See,.e.g., Orlandi, R. et.aL (1989) Proc. Nad. Acad Sci. USA , 86:3833-3837; Winter, 

G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for HTFS may also be generated For 
example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab')2 
30 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 

identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:12754281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
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polyclonal or monoclonal antibodies with established speciflcities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between HTFS and its specific 
antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering HTFS epitopes is generally used, but a competitive binding assay may also be employed 
5 (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for HTFS. Affinity is expressed as an association 
constant, K^, which is defined as the molar concentration of HTFS-antibody complex divided by the 
molar concentrations of free antigen and free antibody undo- equilibrium conditions. The determined 

10 for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple HTFS 
epitopes, represents the average affinity, or avidity, of the antibodies for HTFS. The K* determined for 
a preparation of monoclonal antibodies, which are monospecific for a particular HTFS epitope, 
represents a true measure of affinity. High-affinity antibody preparations with K, ranging from about 
10 9 to 10 12 L/mole are preferred for use in immunoassays in which the HTFS-antibody complex must 

15 withstand rigorous manipulations. Low-affinity antibody preparations with ranging from about 10 6 
to 10 7 L/mole are preferred for use in immunopurification and similar procedures which ultimately 
require dissociation of HTFS, preferably in active form, from the antibody (Catty, D. (1988) 
Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; Lidddl, J.E. and A. Cryer 
(1991) A Practical Guide to Monoclonal Antibodies, John Wiley & Sons, New York NY). 

20 The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 

the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of HTFS-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 

25 "antibody quality and usage in various applications, are generally available. (See, e.g.* Catty, sugra, and_ 
Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding HTFS, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
30 PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding HTFS. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding HTFS. (See, 
e.g., Agrawal, S. f ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
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sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellulaiiy in toe form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J£. et al. (1998) J. Allergy Clin. Immunol. 102<3):469-475; and Scanlon, K.J. et al. (1995) 
5 9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g.. Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et 
10 al. (1998) J. Pharm. Sci. 87(1 1):1308-1 315 ; and Morris, MC. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding HTFS may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X-linked 

15 inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475). 
cystic fibrosis (Zabncr, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum Gene Therapy 6:667-703), thalassamias, familial 

20 hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (eg., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 

25 Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399). 

hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
hrasiliensis : and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi). In the 
case where a genetic deficiency in HTFS expression or regulation causes disease, the expression of 
HTFS from an appropriate population of transduced cells may alleviate the clinical manifestations 

30 caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in HTFS 
are treated by constructing mammalian expression vectors encoding HTFS and introducing these 
vectors by mechanical means into HTFS-deficienl cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 

35 particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 

39 ; 
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use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R&ipon (1998) Curr. Opin. Biotechnol. 9:445- 
450). 

Expression vectors that may be effective for the expression of HTFS include, but are not 

5 limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad C A), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). HTFS may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (ag., the 

10 tetracyclinc-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 

89:5547-5551; Gossen, M. et aL (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M Blau (1998) 
Curr. Opia Biotechnol. 9:451-456), commercially available intheT-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and FIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promote (Rossi, F.M. V. 

15 and H.M. Blau, supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding HTFS from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 

20 parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982)EMBOJ. 1:841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 

25 - respect to.HTFS expression are treated by. constructing a retrovirus vector consisting of (i) the 

polynucleotide encoding HTFS under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus m-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (eg., PFB and PFBNEO) are 

30 commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein The vector is propagated in an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VS Vg (Armentano, D, et al. 
(1987) J. Virol 61:1647-1650; Bender, MA. et al. (1987) J. Virol. 61:1639-1646; Adam, MA. and 
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A.D. Miller (1988) J. ViroL 62:3802-3806; Dull, T. et al. (1998) J, Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
5 Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-ceUs), and the 
return of transduced cells to a patient are procedures well known to pa-sons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707^716; Ranga, U. al. 
(1998) Proc. Natl. Acad T Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

10 In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 

polynucleotides encoding HTFS to cells which have one or more genetic abnormalities with respect to 
the expression of HTFS. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 
versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 

15 (Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A et al. (1999) Annu. 
Rev. Nutr. 19:5 1 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

20 In another alternative* a herpes-based, gene therapy delivery system is used to deliver 

polynucleotides encoding HTFS to target cells which have one or more genetic abnormalities with 
respect to the expression of HTFS. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing HTFS to cells of the central nervous system, for which HS V has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
'25 "* ordinary skill in the art- A rephcation-competent herpes simplex virus (HSV) type 1-based vector has^ 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res.l69:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,41 3 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant HSV 

30 d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu. H. et al. (1994) Dev. 
Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
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sequences, the generation of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large herpesvirus genomes, the growth and propagation of 
herpesvirus, and the infection of cells with herpesvirus arc techniques well known to those of ordinary 
skill in the art 

5 In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 

deliver polynucleotides encoding HTFS to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), bas been studied extensively and gene transfer vectors have been based on 
the SFV genome (Garoff, H. and IC-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 

10 proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting 
in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity (eg., 
protease and polymerase). Similarly, inserting the coding sequence for HTFS into the alphavirus 
genome in place of the capsid-coding region results in the production of a large number of HTFS- 
coding RNAs and the synthesis of high levels of HTFS in vector transduced cells. While alphavirus 

15 infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of HTFS into a variety of cell types. The specific transduction of a subset of cells in a 

20 population may require the sorting of cells prior to transductioa The methods of manipulating 
infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 
performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 

25 be achieved using triple helix base-pairing m^hodology. Triple hdix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B E. and B.L Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt Kisco NY, pp. 163-177.) A 

30 complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
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engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding HTFS. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 

5 GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

10 Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 

any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding HTFS. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 

15 RNA polymerase promoters such asT7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3'ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 

20 within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nbntraditibnal bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 
An additional embodiment of the invention encompasses a method for screening for a 

25 --compound which is effective in altering expression of a polynucleotide encoding HTFS . .Compounds _ 
which maybe effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 

30 compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased HTFS 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding HTFS may be therapeutically useful, and in the treament of disorders associated with 
decreased HTFS expression or activity, a compound which specifically promotes expression of the 

35 polynucleotide encoding HTFS may be therapeutically useful. 
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At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 

5 library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorial^ or randomly. A sample comprising a 
polynucleotide encoding HTFS is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 

10 biochemical system. Alterations in the expression of a polynucleotide encoding HTFS are assayed by 
any method commonly known in the art Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding HTFS. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 

15 or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, 
D. et al. (1999) U.S. Patent No. 5,932,435; Aindt, G.M. et al. (2000) Nucleic Acids Res. 28:E15) or a 

20 human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commua 
268 :8-l 3). A particular embodiment of the present invention involves screening a combinatorial 
library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and 
modified oligonucleotides) for antisense activity against a specific polynucleotide sequence (Bmice, 
T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

25 Many methods for introducing vectors into cells or tissues are available and equally suitable for 

use in vivo , in vitro, and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g„ Goldman, CMC et al. (1997) Nat. 

30 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
35 generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient 



WO 01/32888 



PCT/USOO/30485 



Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of HTFS, 
antibodies to HTFS, and mimetics, agonists, antagonists, or inhibitors of HTFS. 

5 The compositions utilized in this invention may be administered by any number of routes 

including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

10 These compositions are generally aerosolized immediately prior to inhalation by the patient In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. et al., U.S. 

15 Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositioas suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
an effective dose is well within the capability of those skilled in the art 

20 Specialized forms of compositions may be prepared for direct intraceJlutar delivery of 

macromolecules comprising HTFS or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, HTFS or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein Fusion proteins thus generated have been found to 

25 _ transduce into the cals of all tissues,.including the brain, in a ra>use mocM system (Schw - 

al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and route 
30 of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example HTFS 
or fragments thereof, antibodies of HTFS, ami agonists, antagonists or inhibitors of HTFS, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED* (the dose therapeutically effective in 50% of the population) or .LD* (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the UVED50 ratio. Compositions which exhibit large 

5 therapeutic indices are preferred The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED 50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administratioa 

10 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination^), reaction sensitivities, and response to therapy. 

15 Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 //g to 100,000 ^g, up to a total dose of 
about 1 gram, depending upon the route of administratioa Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art 

20 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HTFS may be used for the diagnosis 
25 of disorders characterized by expression of HTFS, or Jn assays to momtor.patients.being. treated with 
HTFS or agonists, antagonists, or inhibitors of HTFS. Antibodies useful for diagnostic purposes may 
be prepared in the same manner as described above for therapeutics. Diagnostic assays for HTFS 
include methods which utilize the antibody and a label to detect HTFS in human body fluids or in 
extracts of cells or tissues, the antibodies may be used with or without modification, and may be 
30 labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter 
molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring HTFS, including ELISAs, RIAs, and FACS, are known in 
the art and provide a basis for diagnosing altered or abnormal levels of HTFS expression. Normal or 
standard values for HTFS expression are established by combining body fluids or cell extracts taken 
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from normal mammalian subjects, for ©cample, human subjects, with antibody to HTFS under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of HTFS expressed in subject, 
control, and disease samples from biopsied tissues are compared with the standard values. Deviation 

5 between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HTFS may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DN A molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of HTFS may be correlated with 

10 disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
HTFS, and to monitor regulation of HTFS levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding HTFS or closely related molecules may be used to 
identify nucleic acid sequences which encode HTFS. The specificity of the probe, whether it is made 

15 from a highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occuiring sequences encoding HTFS, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 

20 sequence identity to any of the HTFS encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:43-84 or from 
genomic sequences including promoters, enhancers, and introns of the HTFS gene. 

Means for producing specific hybridization probes for DN As encoding HTFS include the 
cloning of polynucleotide sequences encoding HTFS or HTFS derivatives into vectors for the 

25 production of mRNA probes. Such vectors, are known in the art, are commercially available, and may - 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as *P or 33 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

30 Polynucleotide sequences encoding HTFS may be used for the diagnosis of disorders associated 

with expression of HTFS. Examples of such disorders include, but are not limited to, a cell 
proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 

35 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
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particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an immune 
system disorder such as inflammation, actinic keratosis, acquired immunodeficiency syndrome 

5 (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, 
amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, 
autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's 
disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 

10 disease, Hashimoto* s thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, 
irritable bowel syndrome, episodic lymphopenia with lymphocytotoxins, mixed connective tissue 
disease (MCTD), multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, 
Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 

15 systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, trauma, and hematopoietic cancer including lymphoma, leukemia, and 
myeloma. The polynucleotide sequences encoding HTFS may be used in Southern or northern analysis, 
dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat 

20 ELIS A-like assays; and in microarrays utilizing fluids or tissues from patients to detect altered HTFS 
expression. Such qualitative or quantitative methods are well known in the art 

In a particular aspect, the nucleotide sequences encoding HTFS may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding HTFS may be labeled by standard methods and added to a fluid or tissue sample 

25 from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 

incubation period! the sample is washed and the signal is quantified and compared with a standard 

value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding HTFS in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 

30 of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression of HTFS, 
a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 

35 fragment thereof, encoding HTFS, under conditions suitable for hybridization or amplificatioa 
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Standard hybridization may be quantified by comparing the values obtained from normal subjects with 
values from an experiment in which a known amount of a substantially purified polynucleotide is used 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 

5 presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 

10 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 

15 preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses far oligonucleotides designed from the sequences encoding HTFS 
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding HTFS, 

20 or a fragment of a polynucleotide complementary to the polynucleotide encoding HTFS, and will be 
employed under optimized conditions for identification of a specific gene or condition. Oligomers may 
also be employed under less stringent conditions for detection or quantification of closely related DNA 
or RNA sequences. 

in a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 

25 _ encoding HTFS may be used to <^ea single nucleotide polymorphisms (SNPs). SNPs are - - 

substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding HTFS are used to amplify DNA using the 
30 polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 
and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels, in fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as 
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DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 

5 errors using statistical models and automated analyses of DNA sequence chromatograms. In the 

alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of HTFS include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

10 standard curves. (See, e.g„ Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

15 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, J J. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 

20 used to identify genetic variants, mutations, and polymorphisms. This information may be used to 

determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 

25 . treatment regimen for that patient For example, therapeutic agents which are highly effective and 

display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies specific for HTFS, or HTFS or fragments thereof may be 
used as dements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above 

30 A particular embodiment relates to the use of the polynucleotides of the present invention to 

generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and Ihdr relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis,** U.S. Patent Number 
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5,840,484, expressly incorporated by reference hereia) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 

5 invention or their complements comprise a subset of a plurality of elements on a microarxay. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

10 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 

15 (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and NX. Anderson (2000) 

Toxicol Lett. 112-113:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome- wide measurement of expression 

20 provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
afro- treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 

25 . necessary for the statistical nratt^ng of signature (See,for - 

example, Press Release 00-02 from the National Institute of Environmoital Health Sciences, released 
February 29, 2000, available at httpy/ww.nie^.nikgov/oc/news/toxdiip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

30 In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 

containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 

35 levels in an untreated biological sample. Differences in the transcript levels between the two samples 
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are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 

5 can be subjected individually to further analysis. Proteome expression pattens, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance undo- given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 

10 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are 
visualized in the gd as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 

15 positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 

20 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present inventioa In 
some cases, further sequence data may be obtained for definitive protein identificatioa 

A proteomic profile may also be generated using antibodies specific for HTFS to quantify the 
levels of HTFS expressioa In one embodiment, the antibodies are used as dements on a microarray, 
and protdn expression levds are quantified by exposing the microarray to the sample and detecting the 

25 levds of protdn bound to each airay demetf (Lucking, A ^ 

Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the protdns in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array dement. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 

30 be analyzed in paralld with toxicant signatures at the transcript levd. There is a poor corrdation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Sdlhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
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degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
- sample are separated so that the amount of each protein can be quantified. The amount of each protein 
5 is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample Individual proteins are identified by sequencing the amino acid residues of the 
individual prolans and comparing these partial sequences to the polypeptides of the present invention. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 

10 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present inventioa The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

15 Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad Sci. 
USA 93:10614-10619; Baldeschwdler et al. (1995) PCT application W095/251 116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of micToarxays are well 

20 known and thoroughly described in DNA Microarravs: A Practical Approach . M Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding HTFS may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 

25. presferableovoicoding sequences., For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 

30 constructions, or single chromosome cDNA libraries. (See, eg., Harrington, J.J. et al. (1997) Nat. 
Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B J. (1991) Trends Genet 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 

35 Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 
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Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data, (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding HTFS on a physical map 

5 and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 
associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 

10 reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g„ 

15 Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc. , 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, HTFS, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 

20 screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between HTFS and the agent being tested may be measured 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Gcysen, et al. (1984) PCT 

25 application WO84/03564.) In this ►method, large numbers of (UffCTem sman jest compounds are. - — 

synthesized on a solid substrate. The test compounds are reacted with HTFS, or fragments thereof, and 
washed Bound HTFS is then detected by methods well known in the art Purified HTFS can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

30 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding HTFS specifically compete with a test compound for binding HTFS. In 
this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with HTFS. 

In additional embodiments, the nucleotide sequences which encode HTFS may be used in any 
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molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
5 description, utilize the present invention to its fullest extent Hie following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, in 
particular U.S. Serial No. 60/163,595, are hereby expressly incorporated by reference. 

10 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissueis described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 

15 in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monpphasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol attraction and precipitation of RNA were repeated as necessary to increase RNA 

20 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g. , the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

25 - - In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA - 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

30 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 

35 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNAl 1 plasmid 
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(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo AltoCA). Recombinant 
plasmids were transformed into competent Rcoli cells including XLl-Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DHSa, DH10B, or ElectroMAX DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

5 Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 

using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 
one of the tbllowing: a Magic or WIZARD Minipreps DNA puriOcation system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 plasmid 

10 purification kit from QIAGEN. Following precipitation, plasmids wereresuspended in 0.1 nil of 
distilled water and stored, with or without lyophilization, at 4°C 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 

15 well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 

20 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 

25- PRISM BIGDYE Terminator, cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic, 
separation of cDN A sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art Reading frames within the cDN A 

30 sequences were identified using standard methods (reviewed in Ausubd, 1997, supra, unit 7.7). Some 
of the cDNA sequences were selected for extension using the techniques disclosed in Example VL 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
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references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 

5 the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 

10 also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 

15 eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 

20 and these full length sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, eg., . 
Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

25. .. -The programs described above for the assembly and analysis of full length polynucleotide and - 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:43-84. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

30 Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 
supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
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molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membran&based hybridizations. In addition, the sensitivity of the compute 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 
5 BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 1 00, and is calculated 

10 as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
. by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 

15 product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 1 00% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

20 The results of northern analyses are reported as a percentage distribution of libraries in which 

the transcript encoding HTFS occurred. Analysis involved the categorization ofcDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
_25 celLproliferatioru-neurological,.and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
3. ^ ' • 

V. Chromosomal Mapping of HTFS Encoding Polynucleotides 

30 The cDNA sequences which were used to assemble SEQ ID NO:43-84 were compared with 

sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:43-84 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 

35 from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
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Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO;, to that map location. 

The genetic map locations of SEQ ID NO:44, 46, 48, 49, 52, 59, 60, 62, 68, 78, 85, and 86 

5 are described in The Invention as ranges, or intervals, of human chromosomes. More than one map 
location is reported for SEQ ID NO:59, 78, 85, and 86, indicating that previously mapped sequences 
having similarity, but not complete identity, to SEQ ID NO:59, 78, 85, and 86 were assembled into 
their respective clusters. Hie map position of an interval, in centiMorgans, is measured relative to the 
terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on 

10 recombination frequencies between chromosomal markers. On average, 1 cM is roughly equivalent 
to 1 megabase (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of 
recombination.) The cM distances are based on genetic markers mapped by G6n6thon which provide 
boundaries for radiation hybrid markers whose sequences were included in each of the clusters. 
Human genome maps and other resources available to the public, such as the NCBI M GeneMap'99" 

15 World Wide Web site (http://www.ncbi.nlm.nih.gov/gencmap/) , can be employed to determine if 
previously identified disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of HTFS Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:43-84 were produced by extension of an 
appropriate fragment of the full length molecule using oligonucleotide primers designed from this 

20 fragment. One primer was synthesized to initiate 5 ' extension of the known fragment, and the other 
primer, to initiate 3* extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would result in hairpin 

25 structures and primer-primer dimerizations was avoided. 

Selected human cDN A libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-weJl plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 

30 mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfii DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 

35 alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
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94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 limes; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 \i\ PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
5 and 0.5 \xl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA A 5 //I to 10 (A aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

10 The extended nucleotides were desalted and concentrated, transferred to 384-weIl plates, 

digested with CvUI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0:8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 

15 religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and Iransfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x 
carb liquid media. 

20 The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 

Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1: 
94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72*C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 

25_ using the same conditions as.described above. Samples were diluted with 20% dimethysulfoxide (1:2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (PE Biosystems). 

In like manna-, the polynucleotide sequences of SEQ ID NO:43-84 are used to obtain 5* 

30 regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:43-84 are employed to screen cDNAs, genomic 
DNAs. or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
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specifically described, essentially the same procedure is used with largo* nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /*Ci of [y- 32 P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
5 MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvu II (DuPont NEN). 

10 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
15 compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra), mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
20 technologies should be uniform and solid with a non-porous surface (Sch^na (1999), supra). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and may 
25 ...contain any appropriate number.of elements. (See, eg., Schena, M et al. (1995) Science 270:467-470; - 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the dements of the microarray. Fragments or oligomers suitable for hybridization can be 
30 selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
dements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent labd or other molecular tag for ease of detection. 
After hybridization, norihybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array dement. Alternativdy, laser 
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desorbtion and mass spectrometry may be used for detection of hybridizatioa The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 
5 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method Each poly(A)* RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/Ml RNase inhibitor, 500 pM dATP, 500 \M dGTP, 500 pM dTTP, 40 mM 

10 dCTP, 40 fiM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). Hie reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A)* RNA with 
GEMBR1GHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 

15 incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto C A) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 nil sodium acetate, and 300 ml of 100% ethanol. Hie sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holhrook NY) and 

20 resuspendedinl4pl5XSSC/0.2%SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDN A inserts. PCR amplification uses J 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 

25 amplified in thirty cycles of PCR from ^ initial quantity of 1-2 ng to a fin^_ quantity graterjhan 5. _ ... 

pg. Amplified array elements are then purified using SEPH ACR YL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
30 washes between and after 1 treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 

Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
35 Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
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concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. s 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
5 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 jig each of Cy3 and 
10 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microaiTay surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 pi of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
15 about 6.5 hours at 60 °C The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 °G in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 

20 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. Hie excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X- Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

25 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split* based on wavelength, into two photomultiplier tube detectors (PMT R 1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

30 typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the arcay contains a complementary DNA sequence, allowing the intensity of the signal at that 

35 location to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples 
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from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 
5 Hie output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 

(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
compute-. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 

10 measured simultaneously, the data are first connected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 

15 for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the HTFS-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring HTFS. Although use of oligonucleotides 
comprising from about 1 5 to 30 base pairs is described, essentially the same procedure is used with 

20 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of HTFS. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the HTFS-encoding transcript 

25 X. Expression of HTFS . 

Expression and purification of HTFS is achieved using bacterial or virus-based expression 
systems. For expression of HTFS in bacteria, cDNA is subclonedinto an appropriate vector containing 
an antibiotic resistance gene and an inducible promoter that directs high levels of cDN A transcriptioa 
Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the 

30 T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory element 

Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3), Antibiotic 
resistant bacteria express HTFS upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). 
Expression of HTFS in eukaryotic cells is achieved by infecting insect or mammalian cell lines with 
recombinant Autographica califomica nuclear polyhedrosis virus (AcMNPV), commonly known as 

35 baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDN A encoding HTFS 
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by either homologous recombination or bacterial-mediated transposition involving transfer plasmid 
intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of 
cDNA transcription. Recombinant baculovirus is used to infect Spodoptera frueiperda (Sf9) insect 
cells in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 
5 genetic modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad Sci. USA 
91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, HTFS is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 

10 enzyme from Schistosoma japonicum , enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from HTFS at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 

15 His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 

(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified HTFS obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 
XI. Demonstration of HTFS Activity 

20 Galactosyltransferase activity is determined by measuring the transfer of galactose from UDP- 

galactose to a GlcNAc-terminated oligosaccharide chain in a radioactive assay. (Kolbinger, F. et al. 
(1998) J. Biol. Chem. 273:58-65.) The HTFS sample is incubated with 14 \i\ of assay stock solution 
(180 mM sodium cacodylate, pH 6.5, 1 mg/ml bovine serum albumin, 0.26 mM UDP-galactose, 2 \i\ of 
UDP-[ s H]galactose), 1 \i\ of MnCl 2 (500 mM), and 2.5 \i\ of GlcNAcpO-(CHJ r C0 2 Me (37 mg/ml in 

25 dimethyl sulfoxide) for 60 minutes at37-°Gr The reaction is quenched by the addition of 1 ml of water 
and loaded on a CI 8 Sep-Pak cartridge (Waters), and the column is washed twice with 5 ml of water to 
remove unreacted UDP-[>H]galactose. The [ 3 H]galactosylated GlcNAcpO^CHjVCO^e remains 
bound to the column during the water washes and is eluted with 5 ml of methanol. Radioactivity in the 
eluted material is measured by liquid scintillation counting and is proportional to galactosyltransferase 

30 activity in the starting sample. 

Alternatively, mcthyltransferase activity is determined using a method that measures transfer of 
radiolabeled methyl groups from a donor substrate to an acceptor substrate (Bokar, J. A et al. (supra)). 
Reaction mixtures (50 \i\ final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM 
dithiothrdtol, 3% polyviuylalcohol, 1.5 pCi [me/Ay/- 3 H]AdoMet (0.375 \iM AdoMet) (DuPont-NEN), 
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0.6 fig HTFS, and acceptor substrate (0.4 \ig [ 35 S]RNA or 6-mercaptopurine (6-MP) to 1 mM final 
concentration). Reaction mixtures are incubated at 30°C for 30 minutes, thai 65 °C for 5 minutes. 

Analysis of [methyl- 3 H]KNA is as follows: 1) 50 pi of 2 x loading buffer (20 mM tris-HCl, pH 
7.6, 1 M LiCl, 1 mM EDTA, 1% sodium dodecyl sulphate (SDS)) and 50 \il oligo d(T)-cellulose (10 

5 mg/ml in 1 x loading buffer) are added to the reaction mixture, and incubated at ambient temperature 
with shaking for 30 minutes. 2) Reaction mixtures are transferred to a 96-well filtration plate attached 
to a vacuum apparatus. 3) Each sample is washed sequentially with three 2.4 ml aliquots of 1 x oligo 
d(T) loading buffer containing 0.5% SDS, 0.1 % SDS, or no SDS. and 4) RNA is eluted with 300 Ml of 
water into a 96-well collection plate, transferred to scintillation vials containing liquid scintillant, and 

10 radioactivity determined 

Analysis of [methyl- 3 H]6-MP is as follows: 1) 500 \il 0.5 M borate buffer, pH 10.0, and then 
2.5 ml of 20% (v/v) isoamyl alcohol in toluene are added to the reaction mixtures. 2) The samples 
mixed by vigorous vortexing for ten seconds. 3) After centrifugation at 700g for 10 minutes, 1 .5 ml of 
the organic phase is transferred to scintillation vials containing 0.5 ml absolute ethanol and liquid 

1 5 scintillant, and radioactivity determined, and 4) Results are corrected for the extraction of 6-MP into 
the organic phase (approximately 41 %). 
XII. Functional Assays 

HTFS function is assessed by expressing the sequences encoding HTFS at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 

20 vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3. 1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 //g of recombinant vector are transiently transfected into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporatioa 1-2 //g of an additional plasmid containing sequences encoding a 

25 — marberprotein are co-^^ 

transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion proteia Row cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 

30 apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 
cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
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expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescdn-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
(1994) Flow Cytometry. Oxford, New York NY. 
5 The influence of HTFS on gene expression can be assessed using highly purified populations of 

cells transfected with sequences encoding HTFS and either CD64 or CD64-GFP. CD64 and CD64- 
GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Late Success NY). 
10 mRNA can be purified from the cells using methods well known by those of skill in the art Expression 
of mRNA encoding HTFS and other genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XIIL Production of HTFS Specific Antibodies 

HTFS substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g„ 

15 Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the HTFS amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 

20 selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, &g., Ausubel, 1995, supra, ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 1 A 
peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 

.25 immunogenicity^ (See. e.g.» Ausubel. ,1995. supra .). Rabbits are immunized with the oligopeptidedCLH _ 
complex in complete Freund's adjuvant Resulting antisera are tested for antipeptide and anti-HTFS 
activity by, for example, binding the peptide or HTFS to a substrate, blocking with 1 % BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XIV. Purification of Naturally Occurring HTFS Using Specific Antibodies 

30 Naturally occurring or recombinant HTFS is substantially purified by immunoaffinity 

chromatography using antibodies specific for HTFS. An immunoaffinity column is constructed by 
covalently coupling anti-HTFS antibody to an activated chromatographic resin, such as CNBr-activated 
SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed 
according to the manufacturer's instructions. 
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Media containing HTFS are passed over the immunoaffinity column, and the column is washed 
under conditions that allow the preferential absorbance of HTFS (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/HTFS binding 
(e.g., a buffo- of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), 
5 and HTFS is collected 

XV* Identification of Molecules Which Interact with HTFS 

HTFS, or biologically active fragments thereof , are labeled with 125 I Bolton-Hunter reagent 
(See, e.g., Bolton A.E. and W.M Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled HTFS, washed, and 
10 any wdls with labeled HTFS complex are assayed Data obtained using different concentrations of 
HTFS are used to calculate values for the number, affinity, and association of HTFS with the candidate 
molecules. 

Alternatively, molecules interacting witiTHTFS are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
15 available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

HTFS may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et aL (2000) U.S. Patent 
No. 6,057,101). 

20 

Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention- 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
-25 - - Indeed, various modificationsof the described modes for. carrying out the invention which arc.obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-42, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-42, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-42, and 

10 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-42. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1- 

42. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:43-84. 

6. A recombinant polynucleotide comprising a promoter sequence opcrably linked to a 
polynucleotide of claim 3. 

25 _ . _ _ _ _ 

. . 7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1 , the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
l,and 

35 b) recovering the polypeptide so expressed 
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1 0. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

1 1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:43-84, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof 

14. A method of claim 1 3, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutical^ acceptable excipient 
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17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-42. 

18. A method for treating a disease or condition associated with decreased expression of 

5 functional HTFS, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1 , the method comprising: 

L 10 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutical^ acceptable excipient 

15 

21 . A method for treating a disease or condition associated with decreased expression of 
functional HTFS, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceutical^ acceptable excipient 

24. A method for treating a disease or condition associated with overexpression of functional 
HTFS, comprising administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

c a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

35 b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
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compound that specifically binds to the polypeptide of claim 1 . 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, said method comprising: 

5 a) combining the polypeptide of claim 1 with at least one test compound under conditions 

permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in (he presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in (he presence of the test compound 
10 with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
15 polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

20 c) comparing the expression of the target polynucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof ; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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SEQUENCE LISTING 
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<120> HUMAN TRANSFERASE MOLECULES 

<130> PF-0753 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/163,595 
(151> 1999-11-04 

<160> 84 " . 

<170> PERL Program 

*<210> 1 
<211> 261 

<212> PRT . 
<213> Homo sapiens ^ 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 016233CD1 

<400> 1 

Met Ala Gly Asp Ser Glu Gin Thr Leu Gin Asn His Gin Gin Pro 
1*5 10 15 

Asn Gly Gly Glu Pro Phe Leu lie Gly Val Ser Gly Gly Thr Ala 
20 '25 30 

Ser Gly Lys Ser Ser Val Cys Ala Lys lie Val Gin Leu Leu Gly 
35 40 45 

Gin Asn Glu Val, Asp Tyr Arg Gin Lys Gin Val Val lie Leu Ser 
50 55 60 

Gin Asp Ser Phe Tyr Arg Val Leu Thr Ser Glu Gin Lys Ala Lys 
65 . 70 75 

Ala-Leu-Lys Gly^-Gln Phe- Asn -Phe_ Asp -His Pro_Asp Ala Phe .Asp 
80 85 90 

Asn Glu Leu lie Leu Lys Thr Leu Lys Glu He Thr Glu Gly Lys 
95 100 105 

Thr Val Gin He Pro Val Tyr Asp Phe Val Ser His Ser Arg Lys 
. 110 115 120 

Glu Glu Thr Val Thr Val Tyr Pro Ala Asp Val Val Leu Phe Glu 
125 ' 130 135 

Gly He Leu Ala Phe Tyr Ser Gin Glu Val Arg Asp Leu Phe Gin 
140 . . 145 - 150 

Met Lys Leu Phe Val Asp Thr Asp Ala Asp Thr Arg Leu Ser Arg 
155 160 165 

Arg Val Leu Arg Asp He Ser Glu Arg Gly Arg Asp Leu Glu Gin 
170 175 180 

He Leu Ser Gin Tyr He Thr Phe Val Lys Pro Ala Phe Glu Glu 
185 190 195 

Phe Cys Leu Pro Thr Lys Lys Tyr Ala Asp Val He He Pro Arg 
200 205 210 

Gly Ala Asp Asn Leu Val Ala He Asn Leu lie. Val Gin His He 
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215 

Gin Asp He Leu Asn Gly Gly Pro Ser 
230 

cys Leu Asn Gly Tyr Thr Pro Ser Arg 
245 

Ser Ser Ser Arg Pro His 
260 

<210> 2 

<211> 197 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 078336CD1 



<400> 2 










Met 


Phe 


Tyr 


Leu Val Val 
5 


Phe 


Leu Asn 


1 

Leu 


Tyr 


Asp 


Thr Thr Leu 


Ala 


Leu Gly 






20 






Asp 


Asp 


Cys 


Asp Leu Pro 


Ser 


Ala Gin 






35 






Gly 


Leu 


Gin 


Asn Thr Tyr 


Met 


Gly Tyr 






50 






Ala 


lie 


Lys 


Asp Leu Gly 


lie 


Ser Pro 






65 






Pro 


Gly 


Val 


He Val Ala 


Asn 


Met Thr 






80 






He 


Thr 


Lys 


Gin Leu Glu 


Lys 


Trp Met 




95 






Asn 


Leu 


Tyr 


Ser Ser Ser 


Leu 


Gly Gly 






110 






Met 


Leu. 


lie 


Val Phe His 


Gly 


Lys Tyr 








125 






Trp 


His 


He 


Arg His Leu 


Gly 


Trp Asn 






140 






Glu 


His 


Phe 


Leu Gin Glu 


Ala 


Lys Leu 








155 






His 


Lys 


Pro 


Trp Asp Phe 


Pro 


Ser Val 






170 






Ser 


Trp 


Phe 


Val Pro Asp 


Pro 


Ala Gly 






185 






His 


Ser 











<210> 3 

_<211> 378 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 13 0117CD1 . 

<400> 3 

Met Asp Leu Ala Gly Leu Leu Lys Ser 

1 5 
Val Phe Cys Tyr Val Phe He Ala Ser 
20 

.He Gin Leu Phe Thr Leu Leu Leu Trp 
35 

Phe Arg Lys lie Asn Cys Arg Leu Ser 
50 

Leu Val Met Leu Leu Glu Trp Trp Ser 
65 
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220 




225 


Lys 


Arg Gin Thr Asn 


Gly 


235 




240 


Lys 


Arg Gin Ala Ser 


Glu 


250 




255 


ser 


t»iy Asp lie vji.ii 


lalU 


1U 




1 ^ 


HIS 


Ala Ala Aia rne 




o tr 






Asp 


lie Asn Arg jjeu 


Veil 


a n 






Leu 


Asp Tyr Arg Lys 


Lys 


c c 




O v 


Ser 


Thr Cys Ser Pne 


Asn 


1 0 




/ D 


Glu 


Trp Lys his tain 


Arg 


Q C 

OS 


Q n 


Gin 


Lys Asn vai giu 


bill 


100 


1 nc 


Gly 


Val Ala Tnr ber 


Fro 


115 




1Z u 


ser 


mr lie Asn rro 


Leu 


fin 
lJ 0 




i 7C 

1 J D 


Pro 


Asp Ala Arg Tyr 


Ser 


145 




150 


Leu 


His Trp Asn Gly 


Arg 


160 


165 


His 


Asn Asp Leu Trp 


Glu 


175 


180 


He 


Phe Lys Leu Asn 


His 


190 




195 


V9XX1 


phn T\<=»11 cvs His 


Leu 


10 




15 


GlV 
\?±y 


Tj^u Tie He Asn 


Thr 


25 




30 


Pro 


He Asn Lys Gin 


Leu 


40 


45 


Tyr 


Cys He Ser Ser 


Gin 


55 




60 


Gly 


Thr Glu Cys Thr 


He 


70 




75 
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Phe Thr Asp Pro Arg Ala Tyr Leu Lys Tyr Gly Lys Glu Asn Ala 
80 85 90 

He Val Val Leu Asn His Lys Phe Glu He Asp Phe Leu Cys Gly 
95 100 105 

Trp Ser Leu Ser Glu Arg Phe Gly Leu Leu Gly Gly Ser Lys Val 

110 115 120 

Leu Ala Lys Lys Glu Leu Ala Tyr Val Pro He He Gly Trp Met 

125 130 135 

Trp Tyr Phe Thr Glu Met Val Phe Cys Ser Arg Lys Trp Glu Gin 

140 145 150 

Asp Arg Lys Thr Val Ala Thr Ser Leu Gin His Leu Arg Asp Tyr 

155 160 165 

Pro Glu Lys Tyr Phe Phe Leu He His Cys Glu Gly Thr Arg Phe 

170 175 180 

Thr Glu Lys Lys His Glu lie Ser Met Gin Val Ala Arg Ala Lys 

185 190 195 

Glv Leu Pro Arg Leu Lys His His Leu Leu Pro Arg Thr Lys Gly 
* 200 205 210 

Phe Ala He Thr Val Arg Ser Leu Arg Asn Val Val Ser Ala Val 

215 220 225 

Tvx Asp Cvs Thr Leu Asn Phe Arg Asn Asn Glu Asn Pro Thr Leu 

230 235 240 

Leu Gly Val Leu Asn Gly Lys Lys Tyr His Ala Asp Leu Tyr Val 

245 250 255 

Arg Arg lie Pro Leu Glu Asp lie Pro Glu Asp Asp Asp Glu Cys 

260 265 270 

Ser Ala Trp Leu His Lys Leu Tyr Gin Glu Lys Asp Ala Phe Gin 

275 280 285 

Glu Glu Tyr Tyr Arg Thr Gly Thr Phe Pro Glu Thr Pro Met Val 

290 295 300 

Pro Pro Arg Arg Pro Trp Thr Leu Val Asn Trp Leu Phe Trp Ala 

305 310 315 

Ser Leu Val Leu Tyr Pro Phe Phe Gin Phe Leu Val Ser Met He 

320 325 330 

Arg Ser Gly Ser Ser Leu Thr Leu Ala Ser Phe He Leu Val Phe 

335 340 345 

Phe Val Ala Ser Val Gly Val Arg Trp Met lie Gly Val Thr Glu 

350 355 360 

He Asp Lys Gly Ser Ala Tyr Gly Asn Ser Asp Ser Lys Gin Lys 
» 365 370 375 

Leu Asn Asp 

<210> 4 
<211> 285 
<212> PRT 

<213> Homo sapiens 

<221> raisc_feature * 
<223> Incyte ID No: 267495CD1 

<400> 4 

t\7t r-on Tie Glv Leu Gly Leu Gly Asp Ala Lys Asp He 

10 15 

1 Arg Arg Cys Ser Arg Val Tyr 

25 30 

u Thr Val Gly Lys Glu Ala Leu 

40 45 

a Val Val Ala Asp Arg Glu Glu 

55 60 

e Leu Lys Asp Ala Asp He Ser 

70 75 

y Asp Pro Phe Gly Ala Thr Thr 

85 90 

_a Thr Lys Leu Gly He Pro Tyr 

95 100 105 



Met 


Leu 


Tyr 


Leu 


lie Gly 


Leu 


1 






5 




Thr 


Val 


Lys 


Gly 


Leu Glu 


Val 






20 




Leu 


Glu 


Ala 


Tyr 


Thr Ser 


Val 








35 




Glu 


Glu 


Phe 


Tyr 


Gly Arg 
50 


Lys 


Val 


Glu 


Gin 


Glu 


Ala Asp 
65 


Asn 


Asp 


Val 


Ala 


Phe 


Leu Val 


Val 








80 




His 


Ser 


Asp 


Leu 


Val Leu 


Arg 
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Arg 


Val 


lie 


His 


Asn 


Ala 


be! 


lie 


Met Asn 


Ala U a 1 r*\/€2 

Ala Val Valy Vy5 


Cys 








110 








11C 

JLX3 




lo u 


Gly 


Leu 


Gin 


Leu 


Tyr 


Lys 




c»iy 


taiu inr 


u a l Cay Tlo T7a 1 

. vai ier i ie vai 


rue 








125 








1 *vo 




135- 

X J -J 


Trp 


Thr 


Asp 


Thr 


Trp 


Arg 


Pro 


G1U 


ber rile 


.rne Asp i>ys vai 


Lys 






140 








145 




150 


* Lys 


Asn 


Arg 


Gin Asn 


Giy 


Met. 


HIS 


inr ueu 


f*\rc T ait T on Ion 

Lys Lieu ijeu Ac>p 


lie 






155 








1 fin 

lOU 




1 65 

IDJ 


Lys 


Val 


Lys 


Glu 


Gin 


ser 


Leu 


bill 


Asn beu 


Tin T arc* ^ 1 \ r A y/*T 
lie JjyS biy niy 


T A/O 






170 








175 
i / o 




l an 


He 


Tyr 


Glu 


Pro 


Pro 


Arg 


Tyr 


Met 


ber vai 


Asn c^in Aia Aia 


bin 








185 






1 Oft 




1 95 

173 


Gin 


Leu 


Leu 


Glu 


He 
200 


Val 


Gin 


Asn 


Gin Arg 

Z Ud 


Tin 7N vrr Plit pill 

lie Arg v»iy giu 


pi,, 
(9IU 

Z1U 


Pro 


Ala 


Val 


Thr Glu 


Glu 


Thr 


Leu 


Cys Val 


Gly Leu Ala Arg 


vai 










215 








220 




225 


Gly 


Ala 


Asp 


Asp 


Gin 


Lys 




Aia 


Ala ijly 


inr JjcU Axry vjiii 


Mot" 




230 






235 




240 


Cys 


Thr 


Val 


Asp 


Leu 


Gly 


Glu 


Pro 


Leu His 


Ser Leu He lie 


Thr 






245 






250 




255 


Gly Gly 


Ser 


He 


His 


Pro 


Met 


Glu 


Met Glu 


Met Leu Ser Leu 


Phe 










260 








265 




270 


Ser 


lie 


Pro 


Glu 


Asn 
275 


Ser 


Ser 


Glu 


Ser Gin 
280 


Ser He Asn Gly 


Leu 
285 



<210> 5 
<211> 301 
<212> PRT 
/ <213> Homo sapiens 

. <220> 
<221> misc_feature 
<223> Incyte ID No: 410533CD1 



<400> 5 

Met Leu Ala Leu Arg Val Ala Arg Gly Ser Trp Gly Ala Leu Arg 
15 10 15 

Gly Ala Ala Trp Ala Pro Gly Thr Arg Pro Ser Lys Arg Arg Ala 
20 25 30 

cys Trp Ala Leu Leu Pro Pro Val Pro Cys Cys Leu Gly cys Leu , 

35 40 45 

Ala Glu Arg Trp Arg Leu Arg Pro Ala Ala Leu Gly Leu Arg Leu 
50 55 60 

Pro Gly lie Gly Gin Arg Asn His Cys Ser Gly Ala Gly Lys Ala 
65 ~ 70 75 

Ala Pro Arg Pro Ala Ala Gly Ala Gly Ala Ala Ala Glu Ala Pro 
80 85 90 

-Gly -Gly Gln-Trp -Gly- Pro - Ala -Ser-Thr -Pro -Ser- Leu Tyr-Glu-Asn— - 

95 100 105 

Pro Trp Thr lie Pro Asn Met Leu Ser Met Thr Arg lie Gly Leu 
110. 115 120 

Ala Pro Val Leu Gly Tyr Leu lie lie Glu Glu Asp Phe Asn lie 
125 130 135 

Ala Leu Gly Val Phe Ala Leu Ala Gly Leu Thr Asp Leu Leu Asp 
" - 14cr --■ * 14 5 150 

Gly Phe He Ala Arg Asn Trp Ala Asn Gin Arg Ser Ala Leu Gly 

155 160 165 

Ser Ala Leu Asp Pro Leu Ala Asp Lys lie Leu He Ser lie Leu 

170 175 180 

Tyr Val Ser Leu Thr Tyr Ala Asp Leu He Pro Val Pro Leu Thr 

185 190 195 

Tyr Met He He Ser Arg Asp Val Met Leu lie Ala Ala Val Phe 

200 205 210 

Tyr Val Arg Tyr Arg Thr Leu Pro Thr Pro Arg Thr Leu Ala Lys 

215 . 220 225 

Tyr Phe Asn Pro Cys Tyr Ala Thr Ala Arg Leu Lys Pro Thr Phe 
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Z J u 








235 


240 


Tl rt p-y. Tire T 7-a 1 

lie ser .Lys vai 


TV eT> 


Thr 


Ala 


Val Gin 


Leu He Leu Val Ala 


Ala 


245 








250 


255 


Otii JjcU Ala ttl a 


Pro 


Val 


Phe 


Asn Tyr 


Ala Asp Ser He Tyr 


Leu 




260 








265 


270 


bin 116 Leu irp 


Cys 


Phe 


Thr 


Ala Phe 


Thr Thr Ala Ala Ser 


Ala 


Z / D 








280 


285 


iyr ser Tyr iyr 


MIS 


Tyr Gly Arg Lys 


Thr Val Gin Val He 


Lvs 


290 








295 


300 


Asp 














<210> 6 














<211> 253 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> misc_feature 












<223> Incyte ID 


No: 


852708CD1 






<400> 6 










ser Giu ser tain t*iy 


LrlU 


Met Ala Thr Ser 


Gly 


Asp 


Cys 


Pro Arg 


1 


5 




IV 




Glu Pro Ala Glu 


Cys 


Ser 


Glu 


Ala Gly 


Leu Leu Gin Glu Gly 


vai 




20 








ZD 




Gin Pro Glu Glu 


Phe 


Val 


Ala 


He Ala 


Asp Tyr Ala Ala Thr 


Asp 


35 








a n 




Glu Thr Gin Leu 


Ser 


Phe 


Leu 


Arg Gly 


Glu Lys lie Leu lie 


Leu 




50 










o u 


Arg Gin Thr Thr 


Ala 


Asp 


Trp 


Trp Trp 


Gly Glu Arg Ala Giy 


Cys 


65 






/U 


/ o 


Cys Gly Tyr lie 


Pro 


Ala 


Asn 


His Val 


Gly Lys His val Asp 


bill 


80 








bb 


OA 


Tyr Asp Pro Glu 


Asp 


Thr 


Trp 


Gin Asp 


Glu Glu Tyr Phe Gly 


ber 


95 








100 


lub 


Tyr Gly Thr Leu 


Lys 


Leu 


His 


Leu Glu 


Met Leu Ala Asp tin 


pro 


110 








115 


ion 

1ZU 


Arg Thr Thr Lys 


Tyr 


His 


Ser 


Val He 


Leu Gin Asn Lys oiu 


»>er 


125 








loU 




Leu Thr Asp Lys 


Val 


He 


Leu 


Asp Val 


Giy cys Giy inr oiy 


Tl A 

lie 


140 








14b 




He Ser Leu Phe 


Cys 


Ala 


His 


Tyr Ala 


Arg Pro Arg Aia vai 


Tyr 




155 






160 


165 


Ala Val Glu Ala 


Ser 


Glu 


Met 


Ala Gin 


His Thr Gly Gin Leu 


Val 




170 








175 


180 


Leu Gin Asn Gly 


Phe 


Ala 


Asp 


He He 


Thr Val Tyr Gin Gin 


Lys 


185 






190 


195 


Val -Glu- Asp Val 


Val- 


Leu 


Pro-Glu -Lys- 


Val Asp Val Leu Val 


Ser- 


200 








205 


210 


Glu Trp Met Gly 


Thr 


Cys 


Leu Leu Val Arg Ala Gly Val Arg Ala 


215 






220 


225 


Ala Gly Gly Arg 


Ser 


Trp 


Gly Ala Ser Glu His Gly Leu Gly Trp 


230 








235 


240 


Ala Asn Leu Arg 


He 


Ser 


Arg Val Val Arg Asp Ser Phe 




245 








250 





<210> 7 

<211> 390 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 972944CD1 

<400> 7 
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Met Ser Val Arg Val Ala Arg Val Ala Trp Val Arg Gly Leu Gly 
1 5 10 . 15 

Ala Ser Tyr Arg Arg Gly Ala Ser Ser Phe Pro Val Pro Pro Pro 
20 25 30 

Gly Ala Gin Gly Val Ala Glu Leu Leu Arg Asp Ala Thr Gly Ala 
35 40 45 

Glu Glu Glu Ala Pro Trp Ala Ala Thr Glu Arg Arg Met Pro Gly 
50 55 . 60 

Gin Cys Ser Val Leu Leu Phe Pro Gly Gin Gly Ser Gin Val Val 
65 70 75 

Gly Met Gly Arg Gly Leu Leu Asn Tyr Pro Arg Val Arg Glu Leu 
80 85 90 

Tyr Ala Ala Ala Arg Arg Val Leu Gly Tyr Asp Leu Leu Glu Leu 
95 100 105 

Ser Leu His Gly Pro Gin Glu Thr Leu Asp Arg Thr Val His Cys 

110 115 120 

Gin Pro Ala lie Phe Val Ala Ser Leu Ala Ala Val Glu Lys Leu 

125 130 135 

His His Leu Gin Pro Ser Val He Glu Asn Cys Val Ala Ala Ala 

140 145 150 

Gly Phe Ser Val Gly Glu Phe Ala Ala Leu Val Phe Ala Gly Ala 

155 160 165 

Met Giu Phe Ala Glu Gly Leu Tyr Ala val Lys He Arg Ala Glu 

170 175 180 

Ala Met Gin Glu Ala Ser Glu Ala Val Pro Ser Gly Met Leu Ser 

185 190 195 

Val Leu Gly Gin Pro Gin Ser Lys Phe Asn Phe Ala Cys Leu Glu 

200 205 210 

Ala Arg Glu His Cys Lys ser Leu Gly He Glu Asn Pro Val Cys 

215 ■ 220 225 

Glu Val Ser Asn Tyr Leu Phe Pro Asp Cys Arg Val He Ser Gly 

230 235 240 

His Gin Glu Ala Leu Arg Phe Leu Gin Lys Asn Ser Ser Lys Phe 

245 250 255 

His Phe Arg Arg Thr Arg Met Leu Pro Val Ser Gly Ala Phe His 

260 265 270 

Thr Arg Leu Met Glu Pro Ala Val Glu Pro Leu Thr Gin Ala Leu 

275 280 285 

Lys Ala Val Asp He Lys Lys Pro Leu Val Ser Val Tyr Ser Asn 

.290 295 300 

Val His Ala His Arg Tyr Arg His Pro Gly His He His Lys Leu 

305 310 315 

Leu Ala Gin Gin Leu Val Ser Pro Val Lys Trp Glu Gin Thr Met 

320 325 " 330 

His Ala He Tyr Glu Arg Lys Lys Gly Arg Gly Phe Pro Gin Thr 

335 340 345 

Phe Glu Val Gly Pro Gly Arg Gin Leu Gly Ala He Leu Lys Ser 

350 ~ 355 360 

Cys Asn Met Gin Ala Trp Lys_ Ser Tyr^ Ser Ala Val_Asp Val_ Leu 
' " 365 * 370 375 

Gin Thr Leu Glu His Val Asp Leu Asp Pro Gin Glu Pro Pro Arg 

380 385 390 



<210> 8 

<211> 373 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 99773 0CD1 

<400> 8 

Met Leu He Pro Phe Ser Met Lys Asn Cys Phe Gin Leu Leu cys 
15 10 15 

Asn Cys Gin Val Pro Ala Ala Gly Phe Lys Lys Thr Val Lys Asn 
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20 25 30 

Gly Leu lie Leu Gin Ser He Ser Asn Asp Val Tyr Gin Asn Leu 
35 40 45 

Ala Val Glu Asp Trp He His Asp His Met Asn Leu Glu Gly Lys 
50 55 60 

Pro He Leu Phe Phe Trp Gin Asn Ser Pro Ser Val Val He Gly 
65 70 75 

Arg His Gin Asn Pro Trp Gin Glu Cys Asn Leu Asn Leu Met Arg 
80 85 90 

Glu Glu Gly lie Lys Leu Ala Arg Arg Arg Ser Gly Gly Gly Thr 
95 100 105 

Val Tyr His Asp Met Gly Asn He Asn Leu Thr Phe Phe Thr Thr 

HO 115 120 

Lys Lys Lys Tyr Asp Arg Met Glu Asn Leu Lys Leu He Val Arg 

125 130 135 

Ala Leu Asn Ala Val Gin Pro Gin Leu Asp Val Gin Ala Thr Lys 

140 145 150 

Ara Phe Asp Leu Leu Leu Asp Gly Gin Phe Lys He Ser Gly Thr 

155 160 165 

Ala Ser Lys He Gly Arg Thr Thr Ala Tyr His His Cys Thr' Leu 
. 170 175 180 

Leu Cvs ser Thr Asp Gly Thr Phe Leu Ser Ser Leu Leu Lys Ser 

185 " 190 195 

Pro Tyr Gin Gly He Arg Ser Asn Ala Thr Ala Ser He Pro Ser 

200 205 210 

Leu Val Lys Asn Leu Leu Glu Lys Asp Pro Thr Leu Thr Cys Glu 

215 220 225 

Val Leu Met Asn Ala Val Ala Thr Glu Tyr Ala Ala Tyr His Gin 

230 235 240 

He Asp Asn His He His Leu He Asn Pro Thr Asp Glu, Thr Leu 

245 250 255 

Phe Pro Gly He Asn Ser Lys Ala Lys Glu Leu Gin Thr Trp Glu 

260 265 270 

Trp He Tyr Gly Lys Thr Pro Lys Phe Ser He Asn Thr Ser Phe 

275 280 285 

His Val Leu Tyr Glu Gin Ser His Leu Glu He Lys Val Phe He 

290 295 300 

Asp lie Lys Asn Gly Arg He Glu He Cys Asn He Glu Ala Pro 

305 310 315 

Asp His Trp Leu Pro Leu Glu He Arg Asp Lys Leu Asn Ser Ser 

320 325 330 

Leu lie Gly Ser Lys Phe Cys Pro Thr Glu Thr Thr Met Leu Thr 

335 340 345 

Asn He Leu Leu Arg Thr Cys Pro Gin Asp His Lys Leu Asn Ser 

350 355 360 

Lys Trp Asn He Leu Cys Glu Lys He Lys Gly He Met 

365 370 

--<2-10> 9 — - -- - ----- 

<211> 371 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte f ID No: 1285944CD1 

<400> 9 

Met Ser Phe Arg Lys Val Asn He He He Leu Val Leu Ala Val 
15 10 15 

Ala Leu Phe Leu Leu Val Leu His His Asn Phe Leu Ser Leu Ser 

20 25 30 

Ser Leu Leu Arg Asn Glu Val Thr Asp Ser Gly lie Val Gly Pro 

35 . 40 . . 45 

Gin Pro He Asp Phe Val Pro Asn Ala Leu Arg His Ala Val Asp 

50 55 60 

Gly Arg Gin Glu Glu He Pro Val Val He Ala Ala Ser Glu Asp 
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65 70 75 

Arg Leu Gly Gly Ala lie Ala Ala He Asn Ser He Gin His Asn 
80 85 90 

Thr Arg Ser Asn Val lie Phe Tyr lie Val Thr Leu Asn Asn Thr 
95 100 105 

Ala Asp His Leu Arg Ser Trp Leu Asn Ser Asp Ser Leu Lys Ser 

110 115 120 

He Arg Tyr Lys He Val Asn Phe Asp Pro Lys Leu Leu Glu Gly 

125 130 135 

Lys Val Lys Glu Asp Pro Asp Gin Gly Glu Ser Met Lys Pro Leu 

140 145 150 

Thr Phe Ala Arg Phe Tyr Leu Pro lie Leu Val Pro Ser Ala Lys 

155 160 165 

Lys Ala He Tyr Met Asp Asp Asp Val He Val Gin Gly Asp He 

170 175 180 

Leu Ala Leu Tyr Asn Thr Ala Leu Lys Pro Gly His Ala Ala Ala 

185 190 195 

Phe Ser Glu Asp Cys Asp Ser Ala Ser Thr Lys Val Val He Arg 

200 205 210 

Gly Ala Gly Asn Gin Tyr Asn Tyr He Gly Tyr Leu Asp Tyr Lys 

215 220 225 

Lys Glu Arg He Arg Lys Leu Ser Met Lys Ala Ser Thr Cys Ser 

230 235 240 

Phe Asn Pro Gly Val Phe Val Ala Asn Leu Thr Glu Trp Lys Arg 

245 250 255 

Gin Asn He Thr Asn Gin Leu Glu Lys Trp Met Lys Leu Asn Val 

260 265 270 

Glu Glu Gly Leu Tyr Ser Arg Thr Leu Ala Gly Ser He Thr Thr 

275 280 285 

Pro Pro Leu Leu He Val Phe Tyr Gin Gin His Ser Thr He Asp 

290 295 300 

Pro Met Trp Asn Val Arg His Leu Gly Ser Ser Ala Gly Lys Arg 

305 310 315 

Tyr Ser Pro Gin Phe Val Lys Ala Ala Lys Leu Leu His Trp Asn 

320 325 330 

Gly His Leu Lys Pro Trp Gly Arg Thr Ala Ser Tyr Thr Asp Val 

335 340 345 

Trp Glu Lys Trp Tyr He Pro Asp Pro Thr Gly Lys Phe Asn Leu 

350 355 360 

He Arg Arg Tyr Thr Glu He Ser Asn He Lys 

365 370 

<210> 10 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature _ _ _ „ _ _ . .. .... 

<223> Incyte ID No: 12932 07CD1 

<400> 10 

Met Phe Asn Phe Asp Thr Phe Trp Lys Asn Phe Lys Ser Lys Leu 

1 5 10 15 

Gly Phe He Asn Trp Asp Ala He Asn Lys Asn Gin Val Pro Pro 

20 25 - 30 

Pro Ser Thr Arg Ala Leu Leu Tyr Phe Ser Arg Leu Trp Glu Asp 

35 40 45 

Phe Lys Gin Asn Thr Pro Phe Leu Asn Trp Lys Ala He lie Glu 

50 55 60 

Gly Ala Asp Ala Ser Ser Leu Gin Lys Arg Ala Gly Arg Ala Asp 

65 70 75 

Gin Asn Tyr Asn Tyr Asn Gin His Ala Tyr Pro Thr Ala Tyr Gly 

80 85 90 

Gly Lys Tyr Ser Val Lys Thr Pro Ala Lys Gly Gly Val Ser Pro 

95 100 105 

Ser Ser Ser Ala Ser Arg Val Gin Pro Gly Leu Leu Gin Trp Val 
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110 115 120 

Lys Phe Trp 



<210> 11 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1308125CD1 

<400> 11 

Met Ser Ser Ser Arg Met Glu Gly Lys Ala Lys Tyr He Leu Pro 
1 5 . 10 15 

Thr Glu Thr He Tyr Val Gly Glu Met Lys Asp Gly Met Phe His 

20 25 30 

Gly Glu Gly Thr Leu Tyr Phe Pro Ser Gly Ser Gin Tyr Asp Ala 

35 40 45 

lie Trp Glu Asn Gly Leu Ala He Lys Val Trp Leu Asn Ser Pro 

50 55 60 

lie Trp Thr His Leu Glu Lys Ser Pro Arg Ala lie Thr lie Val 

65 70 75 

Glu Thr Ala Ser lie Thr Gin Ser Arg Gly 

80 85 

<210> 12 

<211> 184 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1439670CD1 ' 
<400> 12 

Met Lys Pro Asp Glu Thr Pro Met Phe Asp Pro ser Leu Leu Lys 
1 5 10 15 

Glu Val Asp Trp Ser Gin Asn Thr Ala Thr Phe Ser Pro Ala lie 
20 25 30 

Ser Pro Thr His Pro Gly Glu Gly Leu Val Leu Arg Pro Leu Cys 
35 40 45 

Thr Ala Asp Leu Asn Arg Gly Phe Phe Lys Val Leu Gly Gin Leu 
50 55 60 

Thr Glu Thr Gly Val Val Ser Pro Glu Glh Phe Met Lys Ser Phe 
65 70 75 

Glu His Met Lys Lys Ser Gly Asp Tyr Tyr Val Thr Val Val Glu 

. ._L- _ ___80 - 85_„_ - — ._.'9<L 

Asp Val Thr Leu Gly Gin He Val Ala Thr Ala Thr Leu lie He 
95 " • " 100 - 105 

Glu His Lys Phe lie His Ser Cys Ala Lys Arg Gly Arg Val Glu 
110 115 120 

Asp Val Val Val Ser Asp Glu Cys Arg Gly Lys Gin Leu Gly Lys 
125 130 135 

Leu Leu Leu Ser Thr Leu Thr Leu Leu Ser Lys Lys Leu Asn- Cys 
140 145 150 

Tyx Lys lie Thr Leu Glu Cys Leu Pro Gin Asn Val Gly Phe Tyr 
155 160 165 

Lys Lys Phe Gly Tyr Thr Val Ser Glu Glu Asn Tyr Met Cys Arg 
170 175 180 

Arg Phe Leu Lys 



<210> 13 
<211> 169 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 14442 81CD1 

<400> 13 

Met Ala Asn Tyr lie His Val Pro Pro Gly Ser Pro Glu Val Pro 

1 5 10 15 

Lys Leu Asn Val Thr Val Gin Asp Gin Glu Glu His Arg Cys Arg 

20 25 30 

Glu Gly Ala Leu Ser Leu Leu Gin His Leu Arg Pro His Trp Asp 

35 40 45 

Pro Gin Glu Val Thr Leu Gin Leu Phe Thr Asp Gly lie Thr Asn 

50 55 60 

Lys Leu lie Gly cys Tyr Val Gly Asn Thr Met Glu Asp Val Val 

65 70 75 

Leu Val Arg lie Tyr Gly Asn Lys Thr Glu Leu Leu Val Asp Arg 

80 85 90 
Asp Glu Glu Val Lys Ser Phe Arg Val Leu Gin Ala His Gly Cys 

95 100 105 
Ala Pro Gin Leu TVr Cvs Thr Phe Asn Asn Glv Leu Cys Tyr Glu 

110 115 120 

Phe lie Gin Gly Glu Ala Leu Asp Pro Lys His Val Cys Asn Pro 

125 130 135 
Ala lie Phe Ser Leu Ser Ser Leu Thr Leu cys Lys Gly Lys Thr 

140 145 150 
Thr Arg Cys Phe Gly Leu Thr Gly Cys Arg Gly Ser Arg Leu Leu 

155 160 165 

Leu Ser Phe Phe 



<210> 14 
<211> 357 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1450140CD1 

<400> 14 

Met Gly Gly Ala Val Ser Ala Gly Glu Asp Asn Asp Asp Leu He 

1 5 10 15 

Asp Asn Leu Lys Glu Ala Gin Tyr He Arg Thr Glu Arg Val Glu 

20 25 3 0 

Gin Ala Phe Arg Ala He Asp Arg Gly Asp Tyr Tyr Leu Glu Gly 

35 40 45 
-Tyr -Arg-Asp Asn Ala Tyr Lys- Asp Leu Ala-Trp -Lys His Gly- Asn 

50 55 60 

lie His Leu Ser Ala Pro Cys He Tyr Ser Glu Val Met Glu Ala 

65 70 75 

Leu Lys Leu Gin Pro Gly Leu Ser Phe Leu Asn Leu Gly Ser Gly 

80 85 90 

Thr Gly Tyr Leu Ser Thr Met Val Gly Leu He Leu Gly Pro Phe 

95 * 100 105 

Gly He Asn His Gly He Glu Leu His Ser Asp Val Val Glu Tyr 

110 115 120 

Ala Lys Glu Lys Leu Glu Ser Phe lie Lys Asn Ser Asp Ser Phe 

125 130 135 

Asp Lys Phe Glu Phe Cys Glu Pro Ala Phe Val Val Gly Asn Cys 

140 145 150 

Leu Gin He Ala Ser Asp ser His Gin Tyr Asp Arg He Tyr Cys 

155 160 165 

Gly Ala Gly Val Gin Lys Asp His Glu Asn Tyr Met Lys He Leu 

170 175 180 

Leu Lys Val Gly Gly He Leu Val Met Pro He Glu Asp Gin Leu 
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185 190 195 

Thr Gin He Met Arg Thr Gly Gin Asn Thr Trp Glu Ser Lys Asn 

200 205 210 

He Leu Ala Val Ser Phe Ala Pro Leu val Gin Pro ser Lys Asn 

215 220 225 

Asp Asn Gly Lys Pro Asp Ser Val Gly Leu Pro Pro Cys Ala Val 

* 230 235 240 
Arg Asn Leu Gin Asp Leu Ala Arg He Tyr He Arg Arg Thr Leu 

245 250 255 

Arg Asn Phe He Asn Asp Glu Met Gin Ala Lys Gly He Pro Gin 

260 265 270 

Arg Ala Pro Pro Lys Arg Lys Arg Lys Arg Val Lys Gin Arg lie 

275 280 285 

Asn Thr Tyr Val Phe Val Gly Asn Gin Leu He Pro Gin Pro Leu 

290 ~ 295 300 

Asp Ser Glu Glu Asp Glu Lys Met Glu Glu Asp He Lys Glu Glu 

* 305 310 315 
Glu Glu Lys Asp His Asn Glu Ala Met Lys Pro Glu Glu Pro Pro 

320 325 330 

Gin Asn Leu Leu Arg Glu Lys lie Met Lys Leu Pro Leu Pro Glu 

335 340 345 

Ser L*u Lvs Ala TVr Leu Thr Tyr Phe Arg Asp Lys 

350 355 

<210> 15 

<211> 100 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 1604828CD1 

<400> 15 s 
Met Asn Val Arg Gly Lys . Val He Leu Ser . Met Leu Val Val Ser 
1 5 10 15 

Thr Val He He Val Phe Trp Glu Phe He Asn Ser Thr Glu Asp 

20 25 30 

Ser Phe Leu Trp He Tyr His Ser Lys Asn Pro Glu Val Asp Asp 

35 40 45 

Ser Ser Ala Gin Lys Gly Trp Trp Phe Leu ser Trp Phe Asn Asn 

50 55 60 

Gly He His Asn Tyr Gin Gin Gly Glu Glu Asp He Asp Lys Glu 

65 70 75 

Lvs Gly Arg Glu Glu Thr Lys Gly Arg Lys Met Thr Gin Gin Ser 

80 85 90 

Phe Gly Tyr Gly Thr Gly Leu He Gin Thr 

95 100 

<210> 16 

<211> 199 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1644023CD1 

<400> 16 

Met Lys Thr Phe He He Gly He Ser Gly Val Thr Asn Ser Gly 

15 10 15 

Lys Thr Thr Leu Ala Lys Asn Leu Gin Lys His Leu Pro Asn Cys 

20 25 30 

Ser Val He Ser Gin Asp Asp Phe Phe Lys Pro Glu Ser Glu He 

35 " 40 45 

Glu Thr Asp Lys Asn Gly Phe Leu Gin Tyr Asp Val Leu Glu Ala 

50 55 60 
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Leu Asn Met Glu Lys Met Met Ser Ala He Ser Cys Trp Met Glu 
65 70 75 

Ser Ala Arg His Ser Val Val Ser Thr Asp Gin Glu Ser Ala Glu 
80 85 90 

Glu lie Pro He Leu He lie Glu Gly Phe Leu Leu Phe Asn Tyr 
95 100 105 

Lys Pro Leu Asp Thr He Trp Asn Arg Ser Tyr Phe Leu Thr He 

110 H5 120 

Pro Tyr Glu Glu Cys Lys Arg Arg Arg Ser Thr Arg Val Tyr Gin 

125 130 135 

Pro Pro Asp Ser Pro Gly Tyr Phe Asp Gly His Val Trp Pro Met 

140 145 150 

Tyr Leu Lys Tyr Arg Gin Glu Met Gin Asp He Thr Trp Glu Val 

155 160 165 

Val Tyr Leu Asp Gly Thr Lys Ser Glu Glu Asp Leu Phe Leu Gin 

170 175 180 

Val Tyr Glu Asp Leu He Gin Glu Leu Ala Lys Gin Lys Cys Leu 

185 190 195 

Gin Val Thr Ala 

<210> 17 

<211> 244 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1723402CD1 

<400> 17 

Met Glu Leu Thr lie Phe He Leu Arg Leu, Ala He Tyr He Leu 

1 5 10 15 

Thr Phe Pro Leu Tyr Leu Leu Asn Phe Leu Gly Leu Trp Ser Trp 

20 25 30 

He Cys Lys Lys Trp Phe Pro Tyr Phe Leu Val Arg Phe Thr Val 

35 40 45 

He Tyr Asn Glu Gin Met Ala Ser Lys Lys Arg Glu Leu Phe Ser 

50 55 60 

Asn Leu Gin Glu Phe Ala Gly Pro Ser Gly Lys Leu Ser Leu Leu 

65 70 75 

Glu Val Gly Cys Gly Thr Gly Ala Asn Phe Lys Phe Tyr Pro Pro 

80 85 90 

Glv Cys Arg Val Thr Cys He Asp Pro Asn Pro Asn Phe Glu Lys 

95 100 105 

Phe Leu He Lys Ser He Ala Glu Asn Arg His Leu Gin Phe Glu 

.110 115 120 

Arg Phe Val Val Ala Ala Gly Glu Asn Met His Gin Val Ala Asp 

.-. — — — ----- -125- ----- 130 135 

Glv Ser Val Asp Val Val Val Cys Thr Leu Val Leu Cys Ser Val 

" 140 145 150 

Lys Asn Gin Glu Arg He Leu Arg Glu Val Cys Arg Val Leu Arg 

155 160 165 

Pro Gly Gly Ala Phe Tyr Phe Met Glu His Val Ala Ala Glu Cys 

170 175 180 

Ser Thr Trp Asn IVr Phe Trp Gin Gin Val Leu Asp Pro Ala Trp 

185 190 195 

His Leu Leu Phe Asp Gly Cys Asn Leu Thr Arg Glu Ser Trp Lys 

200 205 210 

Ala Leu Glu Arg Ala Ser Phe Ser Lys Leu Lys Leu Gin His He 

215 220 225 

Gin Ala Pro Leu Ser Trp Glu Leu Val Arg Pro His He Tyr Gly 

230 235 240 

Tyr Ala Val Lys 
<210> 18 
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<211> 358 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1740585CD1 

<400> 18 

Met Lys Thr Ala Glu Asn He Arg Gly Thr Gly ser Asp Gly Pro 

i s io ; 15 

Arg Lys Arg Gly Leu Cys Val Leu Cys Gly Leu Pro Ala Ala Gly 
20 25 30 

Lys Ser Thr Phe Ala Arg Ala Leu Ala His Arg Leu Gin Gin Glu 
35 40 45 

Gin Gly Trp Ala He Gly Val Val Ala Tyr Asp Asp Val Met Pro 
50 55 60 

Asp Ala Phe Leu Ala Gly Ala Arg Ala Arg Pro Ala Pro Ser Gin 
65 70 75 

Trp Lys Leu Leu Arg Gin Glu Leu Leu Lys Tyr Leu Glu Tyr Phe 
80 85 90 

au w st A i a val He Asn Gly Cys Gin Met Ser Val Pro Pro Asn 
95 " " 100 105 

Arg Thr Glu Ala Met Trp Glu Asp Phe He Thr Cys Leu Lys Asp 
110 115 120 

Gin Asp Leu He Phe Ser Ala Ala Phe Glu Ala Gin ser cys Tyr 
125 130 135 

Leu Leu Thr Lys Thr Ala Val Ser Arg Pro Leu Phe Leu Val Leu 
140 145 150 

Asp Asp Asn Phe Tyr Tyr Gin Ser Met Afg Tyr Glu Val Tyr Gin 
155 160 165 

Leu Ala Arg Lys Tyr Ser Leu Gly Phe Cys Gin Leu Phe Leu Asp 
170 175 180 

Cys Pro Leu Glu Thr Cys Leu Gin Arg Asn Gly Gin Arg Pro Gin 
185 190 195 

Ala Leu Pro Pro Glu Thr He His Leu Met Arg Arg Lys Leu Glu 
200 205 210 

Lys Pro Asn Pro Glu Lys Asn Ala Trp Glu His Asn Ser Leu Thr 
215 220 225 

He Pro Ser Pro Ala Cys Ala Ser Glu Ala Ser Leu Glu Val Thr 
230 235 ^ 240 

Asp Leu Leu Leu Thr Ala Leu Glu Asn Pro Val Lys Tyr Ala Glu 
245 250 255 

Asp Asn Met Glu Gin Lys Asp Thr Asp Arg He He Cys Ser Thr 
260 *" 265 270 

Asn He Leu His Lys Thr Asp Gin Thr Leu Arg Arg lie Val Ser 
275 280 285 

Gin Thr Met Lys Glu Ala Lys Asp Glu Gin Val Leu Pro His Asn 

_ 29Q ^ . . 295 300 - 

Leu Lys Leu Leu Ala Glu Glu Leu Asn Lys Leu Lys Ala Glu Phe 
3 05 310 315 

Leu Glu Asp Leu Lys Gin Gly Asn Lys Lys Tyr Leu Cys Phe Gin 
320 325 330 

Gin Thr lie Asp He Pro Asp Val He Ser Phe Phe His Tyr Glu 
335 340 345 

Lys Asp Asn lie Val Gin Lys Tyr Phe Ser Lys Gin His 
350 355 

<210> 19 
<211> 302 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1810925CD1 
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<400> 19 



Met 


lie Leu Leu 


Asn 


Asn 


ser his 


1 




5 






Ser 


Leu Ala Arg 


ser 


lie 


Pro Glu 




20 






Val 


Tyr His He 


Asn 


HIS 


Giy Asn 










Val 


Asp Ser Trp 


Pro 


GlU 


Tyr Gin 






50 






Lys 


Gin Glu Met 


Thr 


Asp 


Asp Met 




65 






Arg 


Met Phe Ser 


Lys 


Glu 


Pro Gin 




80 






Asn 


Cys Glu He 


Val 


Asn 


Trp Lys 




95 






Leu 


Gin Glu Ser 


Leu Gly 


Glu Gly 






110 






Lys 


Ser Val Lys 


Val 


Glu 


His Ser 


125 






Glu 


Asp He Leu 


Lys 


Leu 


Asn Ala 




140 




- 


Ser 


Trp Ala Glu 


Thr 


Gly 


His Pro 




155 






Thr 


Pro Asn Phe 


Lys 


Tyr 


Ala Gin 






170 






Leu 


Val Asn Asp 


Asn 


Trp 


Lys Arg 






185 






His 


Tyr lie Lys 


Arg 


Cys 


He Glu 




200 






Leu 


Gly Pro Glu 


Gly Val 


Pro Val 






215 






Ser 


Cys Glu Val 


Gly Met 


Ala Tyr 




230 






Thr 


Gly Asn Met 


Ala 


Arg 


Val Met 




245 






Arg 


Gin Lys Asn 


He 


Pro 


Phe Tyr 




260 






GlU 


Asp Ser Arg 


Arg 


Phe 


Val Gly 




275 






Ser 


Cys Glu Trp 


His 


Gin 


Trp Thr 




290 






Pro 


Phe 









<210> 20 

<211> 234 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1915064CD1 

<400> 20 



Met 


ser 


ser 


Glu 


Val 


Ser 


Ala 


Arg 


1 








5 








Arg 


Ser 


Pro 


Ser 


Gly 


Leu 


Arg 


Met 








20 








Gly 


Ser 


Pro 


Phe 


Gly 


Leu 


Glu 


Glu 








35 








Glu 


Cys 


Arg 


Arg 


Cys 
50 


Met 


Gin 


Cys 


Thr 


Arg 


Lys 


His 


His 


Cys 


Arg 


Arg 








65 








Arg 


cys 


Cys 


Ser 


Gin 
80 


Lys 


Val 


Pro 


Asp 


Pro 


Val 


Arg 


Gin 


Cys 


Ala 


Glu 



Lys 


Leu 


Leu Ala Leu Tyr Lys 


10 






15 


Ser 


Leu 


Lys Val Tyr Gly Ser 




25 






30 


Pro 


Phe 


Asn Met Glu Val Leu 




a n 

40 






45 


Met 


Val 


He 


lie Arg Pro Gin 










60 


Asp 


Ser 


Tyr Thr Asn Val Tyr 


70 






75 


Lys 


Ser 


Glu 


Glu Val Leu Lys 


85 






90 


Gin 


Arg 


Leu 


Gin 


He Gin Gly 




100 






105 


He 


Arg 


Val 


Ala 


Thr Phe Ser 




115 






120 


Arg 


Ala 


Leu 


Leu 


Leu Val Thr 


130 






135 


Ser 


Ser 


Lys 


Ser 


Lys Leu Gly 




145 






150 


Asp 


Asp 


Glu 


Phe 


Glu Ser Glu 


160 






165 


Leu 


Asp 


Val 


Ser 


Tyr Ser Gly 




175 






180 


Gly 


Lys 


Asn 


GlU 


Arg Ser Leu 


190 






195 


Asp 


Leu 


Pro 


Ala 


Ala Cys Met 


205 






210 


Ser 


Trp 


Val 


Thr 


Met Asp Pro 




220 






225 


Ser 


Met 


Glu 


Lys 


Tyr Arg Arg 




235 






240 


Val 


Arg 


Tyr 


Met 


Lys Tyr Leu 




250 






255 


He 


Ser 


Val 


Leu 


Glu Glu Asn 




265 






270 


Gin 


Phe 


Gly 


Phe 


Phe Glu Ala 




280 




285 


Cys 


Tyr 


Pro 


Gin 


Asn Leu Val 


295 






300 









Arg 


Asp 


Ala Lys Lys Leu Val 




10 


15 


Val 


Pro 


Glu His Arg Ala Phe 




25 


30 


Pro 


Gin 


Trp Val Pro Asp Lys 




40 


45 


Asp 


Ala 


Lys Phe Asp Phe Leu 




55 


60 


Cys 


Gly 


Lys Cys Phe Cys Asp 




70 


75 


Leu 


Arg 


Arg Met Cys Phe Val 




85 


90 


Cys 


Ala 


Leu Val Ser Leu Lys 
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95 




. 100 


105 


GlU 


Ala 


GlU 


Phe 


Tyr Asp 


Lys 


Gin Leu Lys Val Leu Leu Ser 


Gly 










110 


115 


120 


Ala 


Thr 


Phe 


Leu 


Val Thr 


Phe 


Gly Asn Ser Glu Lys Pro Glu 


Thr 










125 




130 


135 


Met 


Thr 


Cys 


Arg 


Leu Ser 


Asn 


Asn Gin Arg Tyr Leu Phe Leu 


Asp 






140 




145 


150 


Gly 


Asp 


Ser 


His 


Tyr Glu 


lie 


Glu lie Val His lie Ser Thr 


Val 






155 




160 


165 


Gin 


lie 


Leu 


Thr Glu Gly 


Phe 


Pro Pro Gly Gly Gly Asn Ala 


Arg 










170 




175 


180 


Ala 


Thr 


Gly 


Met 


Phe Leu 


Gin 


Tyr Thr Val Pro Gly Thr Glu 


Gly 








185 




190 


195 


Val 


Thr 


Gin 


Leu 


Lys Leu 


Thr 


Val Val Glu Asp Val Thr Val 


Gly 










200 




205 


210 


Arg 


Arg 


Gin 


Ala 


Val Ala 


Trp 


Leu Val Ala Met His Lys Ala 


Ala 






215 


220 


225 


Lys 


Leu 


Leu 


Tyr 


Glu Ser 


Arg 


Asp Gin 










230 









<210> 21 
<211> 403 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2185608CD1 

<400> 21 



Met 


Ala 


Gly 


Ala 


Ala Thr 


Gin 


Ala Ser Leu Glu Ser Ala Pro 


Arg 


1 






5 




10 


15 


He 


Met 


Arg 


Leu 


Val ' Ala 


Glu 


Cys Ser Arg Ser Arg Ala Arg 


Ala 








20 




25 


30 


Gly 


Glu 


Leu 


Trp 


Leu Pro 


His 


Gly Thr Val Ala Thr Pro Val 


Phe 






35 




40 


45 


Met 


Pro 


Val 


Gly 


Thr Gin 


Ala 


Thr Met Lys Gly lie Thr Thr 


GlU 








50 




. 55 


60 


Gin 


Leu 


Asp 


Ala 


Leu Gly 


Cys 


Arg He Cys Leu Gly Asn Thr 


Tyr 








65 


70 


75 


His 


Leu 


Gly 


Leu 


Arg Pro 


Gly 


Pro Glu Leu He Gin Lys Ala 


Asn 








80 


85 


90. 


Gly 


Leu 


His 


Gly 


Phe Met 


Asn 


Trp Pro His Asn Leu Leu Thr 


Asp 






95 




100 


105 


Ser 


Gly 


Gly 


Phe 


Gin Met 


Val 


Ser Leu Val Ser Leu Ser Glu 


Val 






110 




115 


120 


Thr 


Glu 


Glu 


Gly 


Val Arg 


Phe 


Arg Ser Pro Tyr Asp Gly Asn 


GlU 










125 




130 


135 


Thr 


"Leu" 


Leu 


Ser 


Pro Glu 


Lys "Ser Val^Gln lie Gin" Asn Ala 


Leu 










140 „ _ 




145 


150 


Gly 


Ser 


Asp 


lie 


He Met 


Gin 


Leu Asp Asp Val Val Ser Ser 


Thr 








155 




160 


165 


Val 


Thr 


Gly 


Pro 


Arg Val 


Glu 


Glu Ala Met Tyr Arg Ser He 


Arg 








170 




175 


180 


Trp 


Leu 


Asp 


Arg 


Cys lie 


Ala 


Ala His Gin Arg Pro Asp Lys 


Gin 








185 




190 


195 


Asn 


Leu 


Phe 


Ala 


He He 


Gin 


Gly Gly Leu Asp Ala Asp Leu 


Arg 










200 




205 


210 


Ala 


Thr 


Cys 


Leu 


Glu Glu 


Met 


Thr Lys Arg Asp Val Pro Gly 


Phe 








215 




220 


225 


Ala 


He 


Gly 


Gly 


Leu Ser 


Gly 


Gly Glu Ser Lys Ser Gin Phe 


Trp 










230 




235 


240 


Arg 


Met 


Val 


Ala 


Leu Ser 


Thr 


Ser Arg Leu Pro Lys Asp Lys 


Pro 








245 




250 


255 


Arg 


Tyr 


Leu 


Met 


Gly Val 


Gly 


Tyr Ala Thr Asp Leu Val Val 


Cys 










260 




265 


270 


Val 


Ala 


Leu 


Gly 


Cys Asp 


Met 


Phe Asp Cys Val Phe Pro Thr 


Arg 
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275 








280 




285 


Thr 


Ala 


Arg 


Phe Gly 


Ser Ala 


Leu 


Val 


Pro 


Thr Gly 


Asn Leu Gin 








290 








295 




*> nn 


Leu 


Arg 


Lys 


Lys 


Val 


Phe Glu 


Lys Asp 


Phe 


Gly Pro 


He Asp Pro 




305 








310 




-J J.3 


Glu 


Cys 


Thr 


Cys 


Pro 


Thr Cys 


Gin 


Lys 


His 


Ser Arg 


Ala Pne Leu 






320 








325 






His 


Ala 


Leu 


Leu 


His 


ser Asp 


Asn 


Thr 


Ala 


Ala Leu 


His His Leu 










335 






340 




345 


Thr 


Val 


His 


Asn 


He 


Ala Tyr 


Gin 


Leu 


Gin 


Leu Met 


Ser Ala Val 










350 






355 




360 


Arg 


Thr 


Ser 


He 


Val 


Glu Lys 


Arg 


Phe 


Pro 


Asp Phe 


Val Arg Asp 








365 








370 




375 


Phe 


Met 


Gly 


Ala 


Met 


Tyr Gly 


Asp 


Pro 


Thr 


Leu Cys 


Pro Thr Trp 








380 






385 




390 


Ala 


Thr 


Asp 


Ala 


Leu 


Ala ser 


Val 


Gly 


He 


Thr Leu 


Gly 



395 400 

<210> 22 

<211> 487 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2228862CD1 

<400> 22 



Met 


Arg Arg Gly 


Glu 


Arg 


Arg' Asp Ala Gly 


Arg Pro Arg Pro Glu 


1 


5 




10 






15 


Ser 


Pro Val Pro 


Ala 


Gly 


Arg Ala Ser Leu 


Glu 


Glu 


Pro Pro Asp 






20 


z o 






30 


Gly 


Pro Ser Ala 


Gly 


Gin 


Ala Thr Gly Pro 


Gly Glu Gly Arg Arg 




35 




40 






45 


Ser 


Thr Glu Ser 


Glu 


Val 


Tyr Asp Asp Gly 


Thr 


Asn 


Thr Phe Phe 






50 




55 






60 


Trp 


Arg Ala His 


Thr 


Leu 


Thr Val Leu Phe 


He Leu Thr Cys Thr- 


65 




70 






75 


Leu 


Gly Tyr Val 


Thr 


Leu 


Leu Glu Glu Thr 


Pro Gin Asp Thr Ala 




80 




85 






90 


Tyr 


Asn Thr Lys 


Arg 


Gly 


He Val Ala Ser 


He 


Leu 


Val Phe Leu 


95 


100 






105 


Cys 


Phe Gly Val 


Thr 


Gin 


Ala Lys Asp Gly 


Pro 


Phe 


Ser Arg Pro 


110 




115 






120. 


His 


Pro Ala Tyr 


Trp 


Arg 


Phe Trp Leu Cys 


Val 


Ser Val Val Tyr 




125 




130 






135 


Glu 


Leu Phe Leu 


He 


Phe 


He Leu Phe Gin 


Thr Val Gin Asp Gly 




140 




145 






150 


Arg 


Glh Phe "Leu 


Lys Tyr' 


Val" Asp Pro Lys 


Leu' Gly Val "Pro- Leu- 






155 




160 






165 


Pro 


Glu Arg Asp 


Tyr 


Gly 


Gly Asn Cys Leu 


11 e 


Tyr 


Asp Pro Asp 




170 




175 






180 


Asn 


Glu Thr Asp 


Pro 


Phe 


His Asn He Trp 


Asp 


Lys 


Leu Asp Gly 




185 




190 






195 


Phe 


Val Pro Ala 


His 


Phe 


Leu Gly Trp Tyr 


Leu 


Lys 


Thr Leu Met 






200 




205 






210 


He 


Arg Asp Trp 


Trp 


Met 


Cys Met He lie 


Ser 


val 


Met Phe Glu 




215 




220 






225 


Phe 


Leu Glu Tyr 


Ser 


Leu 


Glu His Gin Leu 


Pro 


Asn 


Phe Ser Glu 




230 




235 






240 


Cys 


Trp Trp Asp 


His 


Trp 


He Met Asp Val 


Leu 


Val 


Cys Asn Gly 


245 




250 






255 


Leu 


Gly lie Tyr 


Cys 


Gly 


Met Lys Thr Leu 


Glu 


Trp 


Leu ser Leu 




260 




265 






270 


Lys 


Thr Tyr Lys 


Trp 


Gin 


Gly Leu Trp Asn 


He 


Pro 


Thr Tyr Lys 




275 




280 






285 


Gly 


Lys Met Lys 


Arg 


He 


Ala Phe Gin Phe 


Thr 


Pro 


Tyr Ser Trp 
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290 



Val 


Arg 


Phe Glu 


Trp 


Lys 


Pro 


Ala 






305 








Ala 


Val 


Cys Gly 


He 


lie 


T All 

Leu 


vai 








320 








Thr 


Phe 


Tyr Leu 


Lys 


Phe 


Val 


Leu 








335 








Leu 


Val 


Leu Leu 


Arg 


Leu 


Val 


Pne 








350 








Ala 


Met 


Arg Glu 


He 


Tyr 


Asp 


Phe 








365 








Lys 


Lys 


Leu Gly 


Pro 


Gin 


Ala 


Trp 








380 








Thr 


Glu 


Leu Leu 


He 


Val 


Val 


Lys 








395 








Leu 


Ser 


Leu Pro 


Phe 


Tyr 


He 


Ser 








410 








vai 


Lieu 


Ala Lieu 


Thr 


- 

Trp 


X 11 IT 


val 








425 






He 


Thr 


Leu Arg 


Tyr 


Lys 


Glu 


Thr 








440 








Lys 


Asp 


Asp Gin 


Glv 


Ser 


Thr 


Val 


455 








Leu 


Gly 


Leu Asp 


Glu 


Asp 


Leu 


Leu 








470 








Glu 


Gly 


Ala Pro 


Thr 


Pro 


Asn 





485 

<210> 23 
<211> 246 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2235577CD1 

<400> 23 



Met Asn Asp 


Met 


Met 
5 


Ser Leu 


Gly 


Leu 


Leu 


Leu 


Trp 


Lys 


Met 


His 


Pro 








20 








Asp Val Ala 


Gly 


Gly 


Thr Gly 


Asp 










35 








Tyr Val Gin 


Ser 


Gin 


His 


Gin 


Arg 










50 








Ala 


Gin 


Gin 


Asn 


Leu 


Ser Trp 


Glu 










65 








-Asn 


Glu-Glu 


Asp- 


Ser 


Leu- Gly 


Gly 










80 






He 


Asn 


Lys 


Glu 


Met 


Leu 


Lys 


Val 










95 








Gin 


Gly 


Tyr 


Arg 


Ala 


Gly 


Leu 


Ala 










110 








Glu 


Leu 


Pro 


Phe 


Asp 


Asp 


Asp 


Lys 










125 








Phe 


Gly 


He 


Arg 


Asn 


Val 


Thr 


His 






140 








Ala 


His 


Arg 


Val 


Leu 


Lys 


Pro 


Gly 










155 








Phe 


ser 


Gin 


Val 


Asn 


Asn 


Pro 


Leu 










170 








Tyr 


Ser 


Phe 


Gin 


Val 


He 


Pro 


Val 








185 








Asp 


Trp 


Lys 


Ser 


Tyr 


Gin 


Tyr 


Leu 










200 








Pro 


Ser 


Gin 


Glu 


Glu 


Phe 


Lys 


Asp 



295 




■a f\ f\ 
JUL) 


Ser Ser 


Leu Arg Arg Trp 


Leu 


310 




J 1 D 


Phe Leu 


Leu Ala Glu Leu 


Asn 


325 




*i t r\ 


Trp Met 


Pro Pro Glu His 


Tyr 


340 




J % D 


Phe Val 


Asn Val Gly Gly 


vai 


355 




JOU 


Met Asp Asp Pro Lys Pro 


HIS 


370 




j / D 


Leu Val 


Ala Ala He Thr 


Ala 


385 




loft 

J y \J 


Tyr Asp 


Pro His Thr Leu 


Thr 


400 




405 


Gin Cys 


Trp Thr Leu Gly 


Ser 


415 




AO f\ 

4Z U 


Trp Arg 


Phe Phe Leu Arg 


Asp 


430 




435 


Arg Trp 


Gin Lys Trp Gin 


Asn 


445 




450 


Glv Asn Glv Asd Gin His 


Pro 


460 




'465 


Gly Pro 


Gly Val Ala Glu 


Gly 


475 




480 



He His Arg Val Trp Lys 


Asp 




10 


15 




Leu Pro Gly Thr Gin Leu 


Leu 




25 


30 




He Ala Phe Arg Phe Leu 


Asn 




40 


45 




Lys Gin Lys Arg Gin Leu 


Arg 




55 


60 




Glu He Ala Lys Glu Tyr 


Gin 




70 


75 




Ser Arg Val Val Val- Cys 


Asp 




85 


90 




Gly Lys Gin Lys "Ala Leu 


Ala 




100 


105 




Trp Val Leu Gly Asp Ala 


Glu 




115 


120 




Phe Asp He Tyr Thr He 


Ala 




130 


135 




He Asp Gin Ala Leu Gin 


Glu 




145 


150 




Gly Arg Phe Leu Cys Leu 


Glu 




160 


165 




He Ser Arg Leu Tyr Asp 


Leu 




175 


180 




Leu Gly Glu Val He Ala 


Gly 




190 


195 




Val Glu Ser He Arg Arg 


Phe 




205 


210 




Met He Glu Asp Ala Gly 


Phe 
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215 220 225 

His Lys Val Thr Tyr Glu Ser Leu Thr Ser Gly He Val Ala He 

230 235 240 

His Ser Gly Phe Lys Leu 

245 

<210> 24 
<211> 410 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2271680CD1 

<400> 24 

Met Trp Ser Gly Arg Lys Leu Gly Ser Ser Gly Gly Trp Phe Leu 
1 5 10 15 

Arg Val Leu Gly Pro Gly Gly Cys Asn Thr Lys Ala Ala Arg Pro 
20 25 30 

Leu He Ser Ser Ala Val Tyr Val Lys Asn Gin Leu Ser Gly Thr 
35 .40 .45 

Leu Gin lie Lys Pro Gly Val Phe Asn Glu Tyr Arg Thr He Trp 
50 55 60 

Phe Lys Ser Tyr Arg Thr lie Phe Ser Cys Leu Asn Arg He Lys 
65 70 75 

Ser Phe Arg Trp Ser Phe Thr Ser Val Ala Gin Ala Gly Val Gin 
80 85 90 

Trp cys Asp Leu Gly Ser Leu Gin Pro Pro Pro Pro Gly Phe Lys 
95 100 105 

Arg Phe Ser Cys Leu Ser Leu Leu Ser His Trp Asp Tyr Arg Tyr 

110 115 120 

Pro Trp Ala Arg Leu Tyr Ser Thr Ser Gin Thr Thr Val Asp Ser 

125 130 135 

Gly Glu Val Lys Thr Phe Leu Ala Leu Ala His Lys Trp Trp Asp 

140 145 150 

Glu Gin Gly Val Tyr Ala Pro Leu His Ser Met Asn Asp Leu Arg 

155 160 165 

Val Pro Phe He Arg Asp Asn Leu Leu Lys Thr He Pro Asn His 

170 175 • 180 

Gin Pro Gly Lys Pro Leu Leu Gly Met Lys He Leu Asp Val Gly 

185 190 195 

Cys Gly Gly Gly Leu Leu Thr Glu Pro Leu Gly Arg Leu Gly Ala 

200 205 210 

Ser Val He Gly He Asp Pro Val Asp Glu Asn He Lys Thr Ala 

215 220 225 

Gin Cys His Lys Ser Phe Asp Pro Val Leu Asp Lys Arg He Glu 

230 235 240 

Tyr Arg Val Cys Ser Leu Glu Glu He Val Glu Glu. Thr Ala Glu 

- ' "245 '"" ' "250 255 

Thr- Phe Asp Ala Val Val Ala Ser Glu Val Val Glu His Val lie 

260 265. 270 

Asp Leu Glu Thr Phe Leu Gin Cys Cys Cys Gin Val Leu Lys Pro 

275 280 285 

Gly Gly Ser Leu Phe He Thr Thr He Asn Lys Thr Gin Leu Ser 

290 295 300 

Tyr Ala Leu Gly He Val Phe Ser Glu Gin He Ala Gly He Val 

305 310 315 

Pro Lys Gly Thr His Thr Trp Glu Lys Phe Val Ser Pro Glu Thr 

320 325 330 

Leu Glu Ser He Leu Glu Ser Asn Gly Leu Ser Val Gin Thr Val 

335 340 345 

Val Gly Met Leu Tyr Asn Pro Phe Ser Gly Tyr Trp His Trp Ser 

350 355 360 

Glu Asn Thr Ser Leu Asn Tyr Ala Ala His Ala Val Lys Ser Arg 

365 370 375 

Val Gin Glu His Pro Ala Ser Ala Glu Phe Val Leu Lys Gly Glu 
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380 385 390 

Thr Glu Glu Leu Gin Ala Asn Ala Cys Thr Asn Pro Ala Val His 

395 400 405 

Glu Lys Leu Lys Lys 

410 

<210> 25 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2325603CD1 

<400> 25 

Met Asn Phe Ser Gly Gly Gly Arg Gin Glu Ala Ala Gly Ser Arg 
1 5 10 15 

Ser Arg Arg Ala Pro Arg Pro Arg Glu Gin Asp Arg Asp Val Gin 
20 25 30 

Leu Ser Lys Ala Leu Ser Tyr Ala Leu Arg His Gly Ala Leu Lys 
35 40 45 

Leu Gly Leu Pro Met Gly Ala Asp Gly Phe Val Pro Leu Gly Thr 
50 55 60 

Leu Leu Gin Leu Pro Gin Phe Arg Gly Phe Ser Ala Glu Asp Val 
65 70 75 

Gin Arg Val Val Asp Thr Asn Arg Lys Gin Arg Phe Ala Leu Gin 
80 . ~ '" .85 90 

Leu Gly Asp Pro Ser Thr Gly Leu Leu lie Arg Ala Asn Gin Gly 
95 100 105 

His Ser Leu Gin Val Pro Lys Leu Glu . Leu Met Pro Leu Glu Thr 
110 115 120 

Pro Gin Ala Leu Pro Pro Met Leu Val His Gly Thr Phe Trp Lys 
125 130 135 

His Trp Pro Ser lie Leu Leu Lys Gly Leu Ser Cys Gin Gly Arg 
140 145 150 

Thr His He His Leu Ala Pro Gly Leu Pro Gly Asp Pro Gly He 
155 160 165 

He Ser Gly Met Arg Ser His Cys Glu He Ala Val Phe He Asp 
170 " 175 180 

Gly Pro Leu Ala Leu Ala Asp Gly He Pro Phe Phe Arg Ser Ala 
185 190 195 

Asn Gly Val He Leu Thr Pro Gly Asn Thr Asp Gly Phe Leu Leu 
200 205 210 

Pro Lys Tyr Phe Lys Glu Ala Leu Gin Leu Arg Pro Thr Arg Lys 
215 220 225 

Pro Leu ser Leu Ala Gly Asp Glu Glu Thr Glu Cys Gin Ser Ser 
230 235 240 

-Pro Lys His- Ser Ser- Arg Glu- Arg Arg- Arg- lie Gin -Gin - — - — 

245 250 



<210> 26 

<211> 303 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2356055CD1 

<400> 26 

Met Lys Gly Gly Phe Thr Gly Gly Asp Glu Tyr Gin Lys His Phe 
15 10 15 

Leu Pro Arg Asp Tyr Leu-Ala Thr Tyr Tyr Ser Phe Asn Gly Ser 

20 25 30 

Pro Ser Pro Glu Ala Glu Met Leu Lys Phe Asn Leu Glu Cys Leu 

35 40 45 
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His 


Lys 


Thr 


Phe 


Gly Pro 


Gly 


Gly Leu Gin Gly Asp Thr Leu He 








50 






re cn 


Asp 


lie 


Gly 


Ser 


Gly Pro 


Thr 


He Tyr Gin Val Leu Ala Ala cys 






65 






/ U /J 


Asp 


ser 


Phe 


Gin 


Asp 


He 


Thr 


Leu Ser Asp Pne Tnr Asp Arg Asn 








80 








Arg 


GlU 


Glu 


Leu 


Glu 


Lys 


Trp 


Leu Lys Lys giu pro laiy Aia lyr 








95 




100 105 


Asp 


Trp 


Thr 


Pro 


Ala 


Val 


Lys 


Pne Ala cys Glu Leu uiu t^±y Asn 






110 






iic ion 

JLJ.D 1ZU 


Ser 


Gly 


Arg 


Trp 


Glu 


Glu 


Lys 


Glu Glu Lys Leu Arg Ala Ala Val 




125 








Lys 


Arg 


Val 


Leu 


Lys 


Cys 


Asp 


Val His Leu Gly Asn Pro Leu Ala 






140 








Pro 


Ala 


Val 


Leu 


Pro 
155 


Leu 


Ala 


Asp Cys Val Leu Thr Leu Leu Ala 

1 cr\ ICC 

160 lob 


Met 


Glu 


Cys 


Ala 


Cys 


Cys 


Ser 


Leu Asp Ala Tyr Arg Ala Ala Leu 








170 






inc. ion 
1 /b loU 


Cys 


Asn 


Leu 


Ala 


Ser 


Leu 


Leu 


Lys Pro Gly Gly His Leu Val Thr 








185 






190 195 


Thr 


Val 


Thr 


Leu Arg 


Leu 


Pro 


Ser Tyr Val Val Gly Lys Arg Glu 










200 






1 ft C *51 A 


Phe 


ser 


Cys 


Val 


Ala 


Leu 


Glu 


Lys Glu Glu Val Ala Ala Arg Gin 








215 






220 22b 


Cys 


Pro 


Gly 


Glu 


GlU 


lie 


Ala 


Lys Glu Arg Arg Leu Gin Met Pro 






230 






--, -> r- ft yi ft 

235 ^40 


Pro 


Pro 


Cys 


Asp Val Arg 


Thr 


Ser Leu Ser Glu Arg Ser Gly Gin 








245 








Asp 


Thr 


Gly 


Lys 


Arg 


His 


Arg 


He Gin Thr Arg Gly Ser Ala Pro 






260 






265 270 


Trp 


Thr 


Ala 


Gin 


Cys 


Arg 


Glu 


Ser Ala Gly Cys Leu Glu Gly Glu 








275 




280 285 


Ser 


Arg 


Gin 


Gly Cys 


Glu 


Gly 


He Phe Gly Cys Cys Gly Ser Cys 








290 






295 300 


Ser 


Thr 


Leu 













<210> 27 

<211> 307 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2448909CD1 

<400> 27 

Met Gin Lys Gly Lys Gly Arg Thr Ser Arg He Arg Arg Arg Lys 

— 1 . - 5 - 10 IS— ------- 

Leu Cys Gly Ser Ser Glu Ser Arg Gly Val Asn Glu Ser His Lys 

20 25 30 

Ser Glu Phe He Glu Leu Arg Lys Trp Leu Lys Ala Arg Lys Phe 

35 40 45 

Gin Asp Ser Asn Leu Ala Pro Ala Cys Phe Pro Gly Thr Gly Arg 

50 55 h 60 

Gly Leu Met Ser Gin Thr Ser Leu Glh Glu Gly Gin Met He He 

65 70 75 

Ser Leu Pro Glu Ser Cys Leu Leu Thr Thr Asp Thr Val He Arg 

80 85 90 

Ser Tyr Leu Gly Ala Tyr He Thr Lys Trp Lys Pro Pro Pro Ser 

95 100 105 

Pro Leu Leu Ala Leu Cys Thr Phe Leu Val Ser Glu Lys His Ala 
110 115 120 

Gly His Arg Ser Leu Trp Lys Pro Tyr Leu Glu lie Leu Pro Lys 
125 130 135 

Ala Tyr Thr Cys Pro Val cys Leu Glu Pro Glu Val Val Asn Leu 
140 145 150 
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Leu 


Pro 


Lys Ser 


Leu 


T 71 1 T ire 
LtyS Ala uyS 


71 1 a pin pi,, ply. 7v y-r* 71 1 a 

nia ulu blu blli ATy nla 


HIS 






ICC 


1 fin 


1 O J 


vai 


bin 


V»1U rfie 


T)V,A 

rfle 


Aia ber oer 


Aiy Asp i'lie xriie ser oe* 


Leu 








x / u 






i fln 
xou 


Gin 


Pro 


Leu fne 


% 1 a 


vjiu Aia vai 


TV en Cor Tl « DVtA Coy* r P\r>" 

Asp oer lie fne oer xyx 


Ser 








1 QC 




1 on 
x ? u 


X 3 D 


Ala 


Leu 


L»eU i rp 


Aia 


i rp t~ys i nr 


vai hsn inr Arg Ala vai 


Tyr 






z u u 


A w 3 




Leu 


ATy 


Fro Arg 


t»in 


Arg biu v-ys 


ueu oer Aia wiu irirv Asp 


1 Jli 






^1 c 




??n 




cys 


Ala 


Leu Ala 


Pro 


Tyr Leu Asp 


lieu lieu Asn his ser irro 


HIS 






Z J u 




9 An 


val 


Gin 


Val Lys 


Ala 


Aia pne Asn 


bm biu inr his oer ryr 


Oil! 

LilU 






245 




250 


255 


He 


Arg 


Thr Thr 


Ser 


Arg Trp Arg 


Lys His Glu Glu Val Phe 


He 






260 


265 


270 


Cys 


Tyr 


Gly Pro 


His 


Asp Asn Gin 


Arg Leu Phe Leu Glu Tyr 


Gly 


275 




280 


285 


Phe 


Val 


Ser Val 


His 


Asn Pro His 


Ala Cys Val Tyr Val Ser 


Arg 








290 




295 


300 


Gly 


Trp 


Asn Gin 


Leu 


Cys Ser 







"305 

<210> 28 
<211> 169 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature* 

<223> Incyte ID No: 2631212CD1 

<400> 28 



Met 


Lys 


Gly 


Ser 


Arg 


He 


Glu 


Leu 


Gly Asp Val Thr Pro His 


Asn 


1 








5 








10 


15 


He 


Lys 


Gin. 


Leu 


Lys 


Arg 


Leu 


Asn 


Gin Val He Phe Pro Val 


Ser 








20 








25 


30 


Tyr 


Asn 


Asp 


Lys 


Phe 


Tyr 


Lys 


Asp 


Val Leu Glu Val Gly Glu 


Leu 










35 








,40 


45 


Ala 


Lys 


Leu 


Ala 


Tyr 


Phe 


Asn 


Asp 


He Ala Val Gly Ala Val 


Cys 










50 








55 


60 


Cys 


Arg 


Val 


Asp 


His 


Ser 


Gin 


Asn 


Gin Lys Arg Leu Tyr He 


Met 










65 








70 


75 


Thr 


Leu 


Gly 


Cys 


Leu 


Ala 


Pro 


Tyr 


Arg Arg Leu Gly lie Gly 


Thr 










80 








85 


90 


Lys 


Met 


Leu 


Asn 


His 


Val 


Leu 


Asn 


He Cys Glu Lys Asp Gly 


Thr 








95 








100 


105 


Phe 


Asp 


Asn 


He 


Tyr 


Leu 


His 


Val 


Gin He Ser Asn Glu Ser 


Ala 








.11.0. 








. ... 115. 


120 


"lie 


Asp 


Phe 


Tyr Arg 


Lys 


Phe 


Gly 


Phe Glu He He Glu Thr 


Lys 










125 








130 


135 


Lys 


Asn 


Tyr 


Tyr 


Lys 


Arg 


lie 


Glu 


Pro Ala Asp Ala His Val 


Leu 






140 








145 


150 


Gin 


Lys 


Asn 


Leu Lys Val 


Pro 


Ser 


Gly Gin Asn Ala Asp Val 


Gin 








155 








160 


165 


Lys 


Thr 


Asp 


Asn 















<210> 29 
<211> 389 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2678733CD1 
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<400> 29. 










Met 


Arg 


val 


Leu 


Val Arg Arg cys 


Trp 


1 








5 






nil. 

Giy 


Aia 


Arg 


Arg 


Gly Arg Pro 


ser 


Pro 








20 






Arg 


Leu 


Gly 


Trp 


Glu Asp Cys Arg 


ASp 










35 






Fro 


Fro 


Trp 


Arg 


Val Leu Phe 


Phe 


Gly 










50 






Gill 


Aia 


Leu 


Arg 


Ala Leu His Ala 


Ala 


bill 






65 






Leu 


lie 


Asp 


Lys Leu Glu 


val 


vai 








80 






Pro 


Lys 


Gly 


Leu 


Pro Val Lys 

QC 


Gin 


Tyr 


Pro 


Val 


Tyr 


Glu 


Trp Pro Asp Val 


Gly 










110 






Gly 


Val 


Val 


Ala 


Ser Phe. Gly Arg 


Leu 










125 






Leu 


Lys 


Phe 


Pro 


Tyr Gly He 


Leu 


Asn 










140 






Pro 


Arg 


Trp 


Arg 


Gly Pro Ala. 


Pro 


Val 










155 






Gly 


Asp 


Thr 


Val 


Thr Gly Val 


Thr 


He 










170 






Arg 


Phe 


Asp 


Val 


Gly Pro He 


Leu 


Lys 










185 






Pro 


Pro 


Lys 


Ser 


Thr Ala Lys 


Glu 


Leu 










200 






Leu 


Gly 


Ala 


Asn 


Met Leu He 


Ser 


Val 








215 






Ser 


Leu 


Ser 


Asn 


Gly Arg Gin Gin 


Pro 










230 






Ala 


Pro 


Lys 


He 


Ser Ala Gly Thr 


Ser 








245 






Gin 


Thr 


Ser 


Glu Gin lie Phe 


Arg 


Leu 










260 






lie 


lie 


Pro 


Leu 


Gin Thr Leu 


Trp 


Met 










275 




Leu 


Asp 


Leu 


Val 


Glu Val Asn 


Ser 


Ser 








290 






Leu 


Thr 


Gly 


Gin 


Ala Leu He 


Pro 


Gly 










305 








Ser 


r» In 
bin 


He 


Leu Leu Val 


Tyr 


Cys 










320 






Val 


Arg 


Ser 


Val 


Met Leu Lys 


Lys 


Ser 










335 






Tyr 


Asn 


Gly 


Tyr Leu His Pro 
- 350- - — 


Trp 


Tyr 


Gin 


Pro 


Ser 


Gin Cys Arg Phe 


Gin 


Thr 










365 






Lys 


Lys 


Gin 


Lys 


Lys Thr Val 


Ala 


Met 



380 

<210> 30 

<211> 600 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 2768571CD1 

<400> 30 

Met Arg Ser Cys Leu Trp Arg Cys Arg 

1 5 
Gin Trp Ser Leu Leu Leu Ala Val Leu 
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Giy Fro Fro 


Leu 


Ala 


Hi e 
nis 


1 o 






ID 


bin irp Arg 


Aia 


Leu 


Ala 
Ala 


ZD 








ser Arg vai 


Arg 


Pin 


Lys 


*± u 








J.I1X. nap vlli 


riic 


Ala 


ai y 


cc 






fin 


TV "V/T f 111 Ann 
AlCJ blU Abll 


Jjy fc> 


vjl IX 


f2l 11 


1 w 






/ j 


inr Met Fro 


Car 
Dei 


Pro 












Aia vai bin 


Car 

■ser 


bin 


Leu 


1 u u 






1UD 


Clew pl i/ f^Tii 

oer biy uiu 


Tyr 


Asp 




lib 






ion 


Leu Asn Glu 


Aia 


Leu 


ne 


t in 
1 J u 






1 J 5 


vai his Fro 


ser 


Cys 


Leu 


1 A C 

14b 




1 cn 

15 0 


lie his xnr 


vai 


Leu 


HIS 


160 






165 


Met Gin lie 


Arg 


Pro 


Lys 


175 






180 


Gin giu inr 


vai 


Pro 


vai 


190 






195 


Glu Aia vai 


Leu 


Ser 


Arg 


205 






210 


Leu Lys Asn 


Leu 


Pro 


Glu 


220 






o o c 

225 


Met Glu Gly 


Ala 


Thr 


Tyr 


235 






240 


Cys lie Lys 


Trp 


G1U 


Pin 
G1U 


OCA 

250 






OTP 

255 


Tyr Arg Ala 


Ti- 
ll^ 


Giy 


Asn 


265 






OTA 

270 


Ala Asn Thr 


T 1 A 

ne 


.Lys 


T All 

Leu 


1 O A 

280 






2o5 


val Leu Ala 


ASp 


Pro 


Lys 


o o c 
295 






J uo 


ser vai lie 


Tyr 


HIS 


Lys 


310 






"5 1 C 

J15 


Lys Asp G ly 


Trp 


Ti- 
ne 


Giy 


325 






330 


Leu Thr Ala 


Thr 


Asp 


Phe 


340 






345 


Gin Lys Asn 


Ser 


Gin 


Ala 


-355 






360 


Leu Arg Leu 


Pro 


Thr 


Lys 


370 






375 


Gin Gin Cys 


lie 


Glu 




385 








His Leu Ser 


Gin 


Gly 


Val 


10 






15 


Val Phe Phe 


Leu 


Phe 


Ala 
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20 25 30 

Leu Pro ser Phe lie Lys Glu Pro Gin Thr Lys Pro Ser Arg His 
35 40 45 

Gin Arg Thr Glu Ash lie Lys Glu Arg Ser Leu Gin Ser Leu Ala 
50 55 60 

Lys Pro Lys Ser Gin Ala Pro Thr Arg Ala Arg Arg Thr Thr He 
65 70 75 

Tyr Ala Glu Pro Val Pro Glu Asn Asn Ala Leu Asn Thr Gin Thr 
80 85 90 

Gin Pro Lys Ala His Thr Thr Gly Asp Arg Gly Lys Glu Ala Asn 
95 100 105 

Gin Ala Pro Pro Glu Glu Gin Asp Lys Val Pro His Thr Ala Gin 

110 115 120 

Arq Ala Ala Trp Lys Ser Pro Glu Lys Glu Lys Thr Met Val Asn 

125 130 135 

Thr Leu ser Pro Arg Gly Gin Asp Ala Gly Met Ala Ser Gly Arg 

140 145 150 

Thr Glu Ala Gin Ser Trp Lys Ser Gin Asp Thr Lys Thr Thr Gin 

155 160 165 

Gly Asn Gly Gly Gin Thr Arg Lys Leu Thr Ala Ser Arg Thr Val 

170 ~ 175 180 

Ser Glu Lys His Gin Gly Lys Ala Ala Thr Thr Ala Lys Thr Leu 

185 190 195 

He Pro Lys Ser Gin His Arg Met Leu Ala Pro Thr Gly Ala Val 
' 200 205 210 

Ser Thr Arg Thr Arg Gin Lys Gly Val Thr Thr Ala Val lie Pro 

215 220 225 

Pro Lys Glu Lys Lys Pro Gin Ala Thr Pro Pro Pro Ala Pro Phe 

230 235 ■ 240 

Gin Ser Pro Thr Thr Gin Arg Asn Gin Arg Leu Lys Ala Ala Asn 

245 250 255 

Phe Lys Ser Glu Pro Arg Trp Asp Phe Glu Glu Lys Tyr Ser Phe 

260 265 270 

Glu He Gly Gly Leu Gin Thr Thr cys Pro Asp Ser Val Lys He 

275 280 285 

Lys Ala Ser Lys Ser Leu Trp Leu Gin Lys Leu Phe Leu Pro Asn 

290 295 300 

Leu Thr Leu Phe Leu Asp Ser Arg His Phe Asn Gin Ser Glu Trp 

305 . 310 315 

Asp Arg Leu Glu His Phe Ala Pro Pro Phe Gly Phe Met Glu Leu 

320 325 330 

Asn Tyr Ser Leu Val Gin Lys val Val Thr Arg Phe Pro Pro Val 

335 340 345 

Pro Gin Gin Gin Leu Leu Leu Ala Ser Leu Pro Ala Gly Ser Leu 

350 355 360 

Arg Cys lie Thr Cys Ala Val Val Gly Asn Gly Gly lie Leu Asn 

365 370 375 

Asn Ser His Met Gly Gin Glu He Asp Ser His Asp Tyr Val Phe 

— * ' "~380 - " 385 "390 

Arg Leu Ser Gly Ala Leu lie Lys Gly Tyr Glu Gin Asp Val Gly 

395 400 405 

Thr Arg Thr Ser Phe Tyr Gly Phe Thr Ala Phe Ser Leu Thr Gin 

410 415 420 

Ser Leu Leu lie Leu Gly Asn Arg Gly Phe Lys Asn Val Pro Leu 

... , 425 430 „ . . 435 

Gly Lys Asp Val Arg Tyr Leu His Phe Leu Glu Gly Thr Arg Asp 

440 445 450 

Tyr Glu Trp Leu Glu Ala Leu Leu Met Asn Gin Thr Val Met Ser 

455 460 465 

Lys Asn Leu Phe Trp Phe Arg His Arg Pro Gin Glu Ala Phe Arg 

470 475 .480 

Glu Ala Leu His Met Asp Arg Tyr Leu Leu Leu His Pro Asp Phe 

485 490 495 

Leu Arg Tyr Met Lys Asn Arg Phe Leu Arg Ser Lys Thr Leu Asp 

500 505 510 

Gly Ala His Trp Arg He Tyr Arg Pro Thr Thr Gly Ala Leu Leu 

515 520 525 
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Leu Leu Thr Ala 


Leu 


Gin 


Leu 


Cys Asp 


Gin 


Val 


Ser 


Ala Tyr Gly 








530 
















Phe lie 


Thr 


Glu 


Gly 


His 


Glu Arg Phe 


Ser 


Asp 


His 


Tyr Tyr Asp 








545 








CCA 






ccc 


Thr Ser Trp Lys 


Arg 


Leu 


He 


Phe Tyr 


lie 


Asn 


His 


Asp Phe Lys 








560 








C £ C 






D / U 


Leu Glu Arg Glu 


Val 


Trp 


Lys Arg Leu 


HIS 


Asp 




«iy lie lie 








rnr 

575 








co n 

DO U 








Arg Leu Tyr Gin 


Arg 


Pro 


Gly Pro Gly 


Thr 


Ala 


Lys 


Ala Lys Asn 








590 








595 






600 


<210> 31 




















<211> 448 




















<212> PRT 




















<213> Homo sapiens 
















<220> 






















<221> misc feature 
















<223> Incyte ID 


No: 


3189062CD1 










<400> 31 




















Met Arg 


Glu 


Asn 


Val 


Val 


Val 


Ser Asn 


Met 


Glu 


Arg 


Glu Ser Gly 


1 






5 








10 






15 


Lys Pro 


Val 


Ala 


Val 


Val 


Ala 


Val Val 


Thr 


Glu 


Pro 


Trp Phe Thr 






20 








25 






30 


Gin Arg 


Tyr 


Arg Glu 


Tyr 


Leu 


Gin Arg 


Gin 


Lys 


Leu 


Phe, Asp Thr 




35 








40 






45 


Gin His 


Arg 


Val 


Glu 


Lys 


Met 


Pro Asp 


Gly 


Ser 


Val 


Ala Leu Pro 






50 






55 






60 


Val Leu 


Gly 


Glu 


Thr 


Leu 


Pro 


Glu Gin 


His 


Leu 


Gin 


Glu Leu Arg 






65 








70 






75 


Asn Arg 


Val 


Ala 


Pro 


Gly. Ser 


Pro Cys 


Met 


Leu 


Thr 


Gin Leu Pro 






80 








85 






90 


Asp Pro 


Val 


Pro 


Ser 


Lys 


Arg 


Ala Gin 


Gly 


Cys 


Ser 


Pro Ala Gin 






95 






100 






105 


Lys Leu 


Cys 


Leu 


Glu 


Val 


Ser 


Arg Trp 


Val 


Val 


Gly 


Arg Gly Val 




110 








115 






120 


Lys Trp 


Ser 


Ala 


Glu 


Leu 


Glu 


Ala Asp 


Leu 


Pro 


Arg 


Ser Trp Gin 






125 








13 0 






135 


Arg His 


Gly 


Asn 


Leu 


Leu 


Leu 


Leu Ser 


Glu 


Asp 


Cys 


Phe Gin Ala 




140 








145 






150 


Lys Gin 


Trp 


Lys 


Asn 


Leu Gly 


Pro Glu 


Leu 


Trp 


GlU 


Thr Val Ala 




155 








160 






165 


Leu Ala 


Leu 


Gly Val 


Gin Arg 


Leu Ala 


Lys 


Arg 


Gly 


Arg Val Ser 








170 








17 5 






180 


Pro Asp 


Gly 


Thr 


Arg 


Thr 


Pro 


Ala Val 


Thr 


Leu 


Leu 


Leu Gly Asp 




185 








190 






IOC 

195 


His" Gly Trp 


Val 


Glu 


'His 


Val 


Asp Asn Gly 


lie 


Arg 


Tyr "Lys Phe 








200 








205 






210 


Asp Val 


Thr 


Gin 


Cys 


Met 


Phe 


Ser Phe Gly 


Asn 


He 


Thr Glu Lys 






215 








220 






<-j o c 

225 


Leu Arg 


Val 


Ala 


Ser 


Leu 


Ser 


cys Ala Gly 


Glu 


Val 


Leu Val Asp 






230 








23 5 








Leu Tyr 


Ala 


Gly 


He 


Gly 


Tyr 


Phe Thr 


Leu 


Pro 


Phe 


Leu val His 




245 








250 






Zoo 


Ala Gly 


Ala 


Ala 


Phe 


Val 


His 


Ala Cys Glu 


Trp 


Asn 


Pro His Ala 






260 








265 






c, 1 u 


Val Val 


Ala 


Leu 


Arg 


Asn 


Asn 


Leu Glu 


He 


Asn 


Gly 


Val Ala Asp 






275 








280 






285 


Arg cys 


Gin 


He 


His 


Phe 


Gly 


Asp Asn Arg 


Lys 


Leu 


Lys Leu Ser 






290 








295 






300 


Asn lie 


Ala 


Asp 


Arg 


Val 


He 


Leu Gly Leu 


He 


Pro 


Ser Ser Glu 






305 








310 






315 


Glu Gly 


Trp 


Pro 


He 


Ala 


Cys 


Gin Val 


Leu 


Arg 


Gin 


Asp Ala Gly 




320 








325 






330 


Gly He 


Leu 


His 


He 


His 


Gin 


Asn Val 


Glu 


Ser 


Phe 


Pro Gly Lys 
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335 340 345 

Asn Leu Gin Ala Leu Gly Val Ser Lys Val Glu Lys Glu His Trp 

350 355 360 

Leu Tyr Pro Gin Gin He Thr Thr Asn Gin Trp Lys Asn Gly Ala 

365 370 375 

Thr Arg Asp Ser Arg Gly Lys Met Leu Ser Pro Ala Thr Lys Pro 

380 385 390 

Glu Trp Gin Arg Trp Ala Glu Ser Ala Glu Thr Arg He Ala Thr 

395 400 405 

Leu Leu Gin Gin Val His Gly Lys Pro Trp Lys Thr Gin He Leu 

410 415 420 

His He Gin Pro Val Lys Ser Tyr Ala Pro His Val Asp His He 

425 430 435 
Val Leu Asp Leu Glu Cys Cys Pro Cys Pro Ser Val Gly 

440 445 

<210> 32 
<211> 346 
<212> PRT 

<213> Homo sapiens 
<220> . 

<221> misc_feature 

<223> Incyte ID No: 3243884CD1 

<400> 32 

Met Ala Ala ser Gly Lys Leu Ser Thr Cys Arg Leu Pro Pro Leu 

1 5 10 15 

Pro Thr He Arg Glu lie He Lys Leu Leu Arg Leu Gin Ala Ala 

20 25 30 

Lys Gin Leu Ser Gin Asn Phe Leu Leu Asp Leu Arg Leu Thr Asp 

35 40 45 

Lys He Val Arg Lys Ala Gly Asn Leu Thr Asn Ala Tyr Val Tyr 

50 55 60 

Glu Val Gly Pro Gly Pro Gly Gly He Thr Arg Ser He Leu Asn 

65 70 75 

Ala Asp Val Ala Glu Leu Leu Val Val Glu Lys Asp Thr Arg Phe 

80 85 90 

He Pro Gly Leu Gin Met Leu ser Asp Ala Ala Pro Gly Lys Leu 

95 100 105 

Arg He Val His Gly Asp Val Leu Thr Phe Lys Val Glu Lys Ala 

110 115 120 

Phe Ser Glu Ser Leu Lys Arg Pro Trp Glu Asp Asp Pro Pro Asn 

125 130 135 

Val His He lie Gly Asn Leu Pro Phe Ser Val Ser Thr Pro Leu 

140 145 150 

He He Lys Trp Leu Glu Asn He Ser Cys Arg Asp Gly Pro Phe 

155 160 165 

- Val- Tyr Gly -Arg Thr- Gln~ Met Thr Leu Thr Phe -Gin Lys Glu- Val 

170 175 _ 180 

Ala Glu Arg Leu Ala Ala Asn Thr Gly Ser Lys Gin Arg Ser Arg 

185 190 195 

Leu Ser Val Met Ala Gin Tyr Leu Cys Asn Val Arg His He Phe 

200 205 210 

Thr He Pro Gly Gin Ala Phe Val Pro Lys Pro Glu Val Asp Val 

215 220 225 

Gly Val Val His Phe Thr Pro Leu He Gin Pro Lys He Glu Gin 

230 235 240 

Pro Phe Lys Leu Val Glu Lys Val Val Gin Asn Val Phe Gin Phe 

245 250 255 

Arg Arg Lys Tyr Cys His Arg Gly Leu Arg Met Leu Phe Pro Glu 

260 265 270 

Ala Gin Arg Leu Glu Ser Thr Gly Arg Leu Leu Glu Leu Ala Asp 

275 280 285 

He Asp Pro Thr Leu Arg Pro Arg Gin Leu Ser lie Ser His Phe 

290 295 300 

Lys Ser Leu Cys Asp Val Tyr Arg Lys Met Cys Asp Glu Asp Pro 
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305 310 315 

Gin Leu Phe Ala Tyr Asn Phe Arg Glu Glu Leu Lys Arg Arg Lys 

320 325 330 

Ser Lys Asn Glu Glu Lys Glu Glu Asp Asp Ala Glu Asn Tyr Arg 

335 340 345 

Leu 



<210> 33 

<211> 173 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID NO: 3400578CD1 

j 

<400> 33 

Met Ala Ser He Leu Arg Thr Pro Gin Ala Leu Gin Leu Thr Leu 
1 5 10 15 

Ala Leu He Lys Pro Asp Ala Val Ala His Pro Leu lie Leu Glu 
20 25 30 

Ala Val His Gin Gin lie Leu Ser Asn Lys Phe Leu lie Val Arg 
35 40 45 

Met Arg Glu Leu Leu Trp Arg Lys Glu Asp Cys Gin Arg Phe Tyr 
50 ' 55 60 

Arg Glu His Glu Ala Gly Pro He Arg Ala Tyr He Leu Ala His 
65 70 75 

Lys Asp Ala He Gin Leu Trp Arg Thr Leu Met Gly Pro Thr Arg 
80 " 85 90 

Val Phe Arg Ala Arg His Val Ala Pro Asp Ser lie Arg Gly ser 
95 100 105 

Phe Gly Leu Thr Asp Thr Arg Asn Thr Thr His Gly Ser Asp Ser 
110 " 115 120 

Val Val Ser Ala Ser Arg Glu He Ala Ala Phe Phe Pro Asp Phe 
•125 " 13 0 135 

Ser Glu Gin Arg Trp Tyr Glu Glu Glu Glu Pro Gin Leu Arg Cys 
140 145 150 

Gly Pro Val Cys Tyr Ser Pro Glu Gly Gly Val His Tyr Val Ala 
155 160 165 

Gly Thr Gly Gly Leu Gly Pro Ala 
170 

<210> 34 

<211> 445 

<212> PRT 

<213> Homo sapiens 



<220> " " " " 

<221> misc_feature . . . 

<223> Incyte ID No: 3422577CD1 

<400> 34 

Met Thr Glu Leu Arg Gin Arg Val Ala His Glu Pro Val Ala Pro 

1 -5 - - . . . 10 15 

Pro Glu Asp Lys Glu Ser Glu Ser Glu Ala Lys Val Asp Gly Glu 

20 25 30 

Thr Ala Ser Asp Ser Glu Ser Arg Ala Glu Ser Ala Pro Leu Pro 

35 40 45 

Val Ser Ala Asp Asp Thr Pro Glu Val Leu Asn Arg Ala Leu Ser 

50 55 60 

Asn Leu Ser Ser Arg Trp Lys Asn Trp Trp Val Arg Gly He Leu 

65 70 75 

Thr Leu Ala Met lie Ala Phe Phe Phe He lie He Tyr Leu Gly 

80 85 90 

pro Met Val Leu Met He He Val Met cys Val Gin lie Lys Cys 

95 100 105 
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Phe 


His 


Glu lie 


lie 


Thr 


He 


Gly 








110 








Asp 


Leu 


Pro Trp 


Phe 


Arg 


Thr 


Leu 






125 








Val 


Asn 


Tyr Phe 


Phe 


Tyr 


Gly 


Glu 






140 








Thr 


Leu 


Val Gin 


Arg 


Glu 


Glu 


Pro 








155 








His 


Arg 


Phe lie 


Ser 


Phe 


Thr 


Leu 






170 








Phe 


Val 


Leu ser 


Leu 


Val 


Lys 


Lys 








185 








Met 


Phe 


Gly Trp 


Thr 


His 


Val 


Thr 






200 








Ser 


His 


Leu Val 


lie 


His 


Asn 


Leu 








215 








He 


Val 


Pro He 


Ser 


Cys 


Val 


He 








23 0 








Met 


Phe 


Gly Phe 


Phe 


Phe 


Gly 


Arg 






245 








Pro 


Lys 


Lys Thr 


Trp 


Glu 


Gly 


Phe 








260 








Val 


Val 


Phe Gly 


Leu 


Leu 


Leu 


Ser 








275 








cys 


Phe 


Val Cys 


Pro 


Val 


Glu 


Tyr 






290 








Thr 


Val 


Asp Cys 


Glu 


Pro 


Ser 


Asp 






305 








Asn 


lie 


Pro Gly 


Val 


He 


Gin 


Ser 








320 








Arg 


Met 


Tyr Pro 


Phe 


Gin 


He 


His 






335 








Ala 


Ser 


Leu He 


Gly 


Pro 


Phe 


Gly 








350 








Lys 


Arg 


Ala Phe 


Lys 


He 


Lys 


Asp 








365 








His 


Gly 


Gly He 


Met 


Asp 


Arg 


Phe 






380 








Thr 


Phe 


Val Asn 


Val 


Tyr 


He 


Ala 








395 






Pro 


Ser 


Lys Leu 


He 


Gin 


Gin 


Phe 






410 








Gin 


Leu 


His He 


Phe 


Asn 


Thr 


Leu 








425 








Gly 


Met 


Leu Thr 


Ser 


Thr 


Thr 


Glu 



440 

<210> 35 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3706809CD1 

<400> 35 

Met Ala Ala Leu Val Arg Pro Ala 

1 5 
Leu Gin Val Val Gin Ala Trp Asp 
20 

Arg Ala Leu Arg Arg Ser Pro Val 
35 

Glu Val Val Glu Gin Lys Arg Ala 
50 

Ala Pro Ser Glu Ala Ser Ala Gin 
65 



Tyr Asn 


Val 


Tyr 


His Ser 


Tyr 


115 








120 


Ser Trp 


Tyr 


Phe 


Leu Leu 


Cys 


130 








135 


Thr Val 


Thr 


Asp 


Tyr Phe 


Phe 


145 






150 


Leu Arg 


He 


Leu 


Ser Lys 


Tyr 


160 








165 


Tyr Leu 


He 


Gly 


Phe Cys 


Met 


175 








180 


His Tyr 


Arg 


Leu 


Gin Phe 


Tyr 


190 








195 


Leu Leu 


lie 


Val 


Val Thr 


Gin 


205 








210 


Phe Glu 


Gly 


Met 


He Trp 


Phe 


220 






225 


Cys Ash 


Asp 


He 


Met Ala 


Tyr 


235 








240 


Thr Pro 


Leu 


He 


Lys Leu 


Ser 


250 






255 


He Gly 


Gly 


Phe 


Phe Ala 


Thr 


265 






270 


Tyr Val 


Met 


Ser 


Gly Tyr 


Arg 


280 








285 


Asn Asn 


Asp 


Thr 


Asn Ser 


Phe 


295 






300 


Leu Phe 


Arg 


Leu 


Gin Glu 


Tyr 


310 








315 


Val He 


Gly 


Trp 


Lys Thr 


Val 


325 








330 


Ser He 


Ala 


Leu 


Ser Thr 


Phe 


340 








345 


Gly Phe 


Phe 


Ala 


Ser Gly 


Phe 


355 








3 60 


Phe Ala 


Asn 


Thr 


He Pro 


Gly 


370 








375 


Asp Cys 


Gin 


Tyr 


Leu Met 


Ala 


385 








390 


Ser Phe 


He 


Arg 


Gly Pro 


Asn 


400 




405 


Leu Thr 


Leu 


Arg 


Pro Asp 


Gin 


415 








420 


Arg Ser 


His 


Leu 


He Asp 


Lys 


430 








435 


Asp Glu 










445 












Arg Phe 


Val 


Val 


Arg Pro 


Leu 


10 








15 


Leu Asp 


Ala 


Arg 


Arg Trp 


Val 


25 








30 


Lys Val 


Val 


Phe 


Pro Ser 


Gly 


40 








45 


Pro Gly 


Lys 


Gin 


Pro Arg 


Lys 


55 








60 


Glu Gin 


Arg 


Glu 


Lys Gin 


Pro 



70 75 
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Leu Glu Glu Ser Ala Ser Arg Ala Pro Ser Thr Trp Glu Glu Ser 
80 85 90 

Gly Leu Arg Tyr Asp Lys Ala Tyr Pro Gly Asp Arg Arg Leu Ser 
95 100 105 

Ser Val Met Thr He Val Lys Ser Arg Pro Phe Arg Glu Lys Gin 

110 115 120 

Gly Lys He Leu Leu Glu Gly Arg Arg Leu He Ser Asp Ala Leu 

125 130 135 

Lys Ala Gly Ala Val Pro Lys Met Phe Phe Phe Ser Arg Leu Glu 

140 145 150 

Tyr Leu Lys Glu Leu Pro Val Asp Lys Leu Lys Gly Val ser Leu 

155 160 165 

He Lys Val Lys Phe Glu Asp He Lys Asp Trp Ser Asp Leu Val 

170 ~ 175 180 

Thr Pro Gin Gly lie Met Gly He Phe Ala Lys Pro Asp His Val 

185 190 195 

Lys Met Thr Tyr Pro Lys Thr Gin Leu Gin His Ser Leu Pro Leu 

200 205 210 

Leu Leu He Cys Asp Asn Leu Arg Asp Pro Gly Asn Leu Gly Thr 

215 220 225 

He Leu Arg Ser Ala Ala Gly Ala Gly Gys Ser Lys Val Leu Leu 

23 0 235 240 

Thr Lys Gly Cys Val Asp Ala Trp Glu Pro Lys Val Leu Arg Ala 

245 250 255 

Gly Met Gly Ala His Phe Arg Met Pro He He Asn Asn Leu Glu 

260 265 270 

Trp Glu Thr Val Pro Asn Tyr Leu Pro Pro Asp Thr Arg Val Tyr 

275 280 285 

Val Ala Asp Asn Cys Gly Leu Tyr Ala Gin Ala Glu Met Ser Asn 
' 290 295 300 

Lys Ala Ser Asp His Gly Trp Val Cys Asp Gin Arg Val Met Lys 

305 310 315 

Phe His Lys Tyr Glu Glu Glu Glu Asp Val Glu Thr Gly Ala Ser 

320 325 330 

Gin Asp Trp Leu Pro His Val Glu Val Gin Ser Tyr Asp Ser Asp 

335 340 345 

Trp Thr Glu Ala Pro Ala Ala Val Val He Gly Gly Glu Thr Tyr 

350 355 ; 360 

Gly Val Ser Leu Glu Ser Leu Gin Leu Ala Glu Ser Thr Gly Gly 

365 370 375 

Lys Arg Leu Leu He Pro Val Val Pro Gly Val Asp Ser Leu Asn 

380 385 390 

Ser Ala Met Ala Ala Ser lie Leu Leu Phe Glu Gly Lys Arg Gin 

395 400 405 

Leu Arg Gly Arg Ala Glu Asp Leu Ser Arg Asp Arg Ser Tyr His 

410 415 420 



<210> 36 
<211> 354 
<212> PRT 

<213> Homo sapiens 



<220> 

-<221> mis cofeature 
<223> Incyte ID No: 37459 14CD1 



<400> 36 



Met 


Ala Pro 


Ala 


Lys 
5 


Ala 


Thr 


Asn 


1 

Ser 


Thr Ala 


Leu 


Trp 


Leu 


Ser 


Gin 








20 








Ala 


Ser Lys 


Ser 


Val 


Thr 


Ala 


His 






35 








Thr 


Pro Leu 


Leu 


Leu 


Glu 


Ala 


Ser 








50 








Asn Glu Lys 


Phe 


Trp Gin 


Phe 


Leu 



Val 


Val Arg 


Leu Leu 


Leu 


Gly 




10 






15 


Leu Gly Ser 


Gly Thr Val Ala 




25 






30 


Leu 


Ala Ala 


Lys Trp 


Pro 


Glu 




40 




45 


Glu 


Phe Met 


Ala Glu 


Glu 


Ser 




55 






60 


Glu 


Thr Val 


Gin Glu 


Leu 


Ala 
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65 




lie 


Tyr 


Lys Gin 


Thr Glu 


Ser Asp 










80 


Leu 


Lys 


Lys 


Ala 


Gly Gin 


Phe Leu 








95 




Leu 


Lys 


Phe Ala 


Phe Ser 


He Arg 








110 




Met 


Phe 


Gin 


Gin 


He Ala 


Ala Asp 










125 




As n 


Ala 


Phe 


Val 


Val lie 


His Lys 










140 




Glu 


lie 


Lys 


Lys 


Leu Leu 


Lys Lys 










155 




Tyr 


Leu 


Phe 


Lys 


Gly Asp 


His Lys 








170 




Leu 


Pro 


Val 


Val 


He Leu 


Tyr Ala 










185 




Ser 


Ala 


Phe 


His 


Lys Val 


Leu Ser 










200 




lie 


Leu 


Tyr 


Val 


Leu Arg 


His Tyr 










215 




Lys 


Met 


Tyr 


Leu 


Ser Gly 


Tyr Gly 










230 




Thr 


Glu 


Tyr 


Lys 


Ala Leu 


Asp Asp 










245 




Asn 


Thr 


Thr 


Val 


Glu Asp 


Glu Thr 










260 




Phe 


Leu 


Phe Gly 


Lys Leu 


Lys Glu 










275 




Asn 


Leu 


Thr 


Ala 


Phe Gin 


Lys Tyr 










290 




Met 


Met 


Pro 


Leu 


Lys Val 


Trp Glu 










305 




Ala 


Ala 


Ser 


Gin 


He Met 


Ser Ala 










320 




Leu 


Met 


Lys 


Asp 


lie Ser 


Gin Asn 






335 




Gin 


Met 


He Gly 


Asn Val 


Leu lie 










350 





<210> 37 

<211> 198 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223>^ ^^^^NojL 4000776CD1 



<400> 37- 










Met 


Ser 


Ser 


Lys 


Arg Ser 


His 


Tyr 


1 








5 






Val 


He 


Val 


Tyr 


Ala Glu 
20 


Lys 


His 


Thr 


Phe. 


Asp 


lie 


Ser Glu 


Ala 


Asn 








35 






Arg 


Asn 


Ser 


lie 


Phe Ser 


Cys 


Lys 








50 






Gly 


Pro 


Lys 


Lys 


Gly Arg 
65 


Tyr 


Pro 


Arg 


Phe 


Val 


Ser 


Glu Thr 


Arg 


Ala 








80 






Gin 


Ala 


Met 


Gin 


Leu Lys 
95 


Ala 


Gly 


lie 


Asp 


Glu 


Thr 


Lys Phe 
110 


Lys 


Ala 


Phe 


Met 


Arg 


Arg 


Ala Gly 


Leu 


Ser 





70 






75 


Tyr 


Ser 


Tyr 


Tyr Asn Leu 


He 


85 






90 


Asp 


Asn 


Leu 


His lie Asn 


Leu 


100 






inc. 

105 


Ala 


Tyr 


Ser 


Pro Ala lie 


Gin 




115 






120 


Glu 


Pro 


Pro 


Pro Asp Gly 


Cys 




.130 






13b 


Lys 


His 


Thr 


Cys Lys He 


Asn 


145 




Arg Thr Arg 


150 


Ala 


Ala 


Ser 


Pro 




160 






165 


Phe 


Pro 


Thr 


Asn Lys Glu 


Asn 




175 






180 


Glu 


Met 


Gly 


Thr Arg Thr 


Phe 




190 






195 


Glu. 


Lys 


Ala 


Gin Asn Glu 


Glu 




205 






210 


He 


Gin 


Lys 


Pro Ser Ser 


Arg 




220 






225 


Val 


Glu 


Leu 


Ala lie Lys 


Ser 




235 






240 


Thr 


Gin 


Val 


Lys Thr Val 


Thr 




250 






255 


Glu 


Thr 


Asn 


Glu Val Gin 


Gly 




265 






270 


lie 


Tyr 


Ser 


Asp Leu Arg 


Asp 




280 






285 


Leu 


He 


Glu 


Ser Asn Lys 


Gin 




O ft c 

295 








Leu 


Gin, 


Asp 


Leu Ser Phe 


Gin 




310 




315 


Pro 


Val 


Tyr 


Asp Ala He 


Lys 




325 






330 


Phe 


Pro 


lie 


Lys Ala Arg 


Val 




340 






345 


Gly 











Asp Ser 


Ala 


Leu 


Lys 


Arg 


Lys 


10 










15 


Gly Asn 


Arg 


Ala 


Ala 


Gly 


Arg 


25 










30 


lie Arg 


Arg 


Trp 


Arg 


Asn 


Asp 


40 










45 


Ala Thr 


Thr 


Lys 


Cys 


Phe 


Thr 


55 










60 


Gin Val 


Asp 


Glu 


Ala 


Val 


Leu 


70 








75 


Lys Gly 


Leu 


Pro 


He 


Thr 


Arg 


85 










90 


Glu Val 


Ala 


Lys 


Thr 


Leu 


Gly 


100 










105 


Thr Arg 


Gly 


Trp 


Cys 


Asp 


Arg 


.115 










120 


Leu Arg 


His 


Gin 


Thr 


Ser 


Phe 
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125 


Cys 


Pro Lys 


Leu 


Pro Thr Ala He Lys 




140 


His 


Ser Phe 


Lys 


Lys Cys Cys He Thr 








155 


Gly Arg Asp 


Val 


Leu Trp Lys Asn Ala 








170 


Leu 


Lys Ser 


Asp 


Ser Glu Glu Leu Asp 




185 


lie 


He Thr 







<210> 38 
<211> 296 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 40713 04CD1 

<400> 38 

Met Met Leu Pro Leu Gin Gly Ala Gin 

1 5 . 

Lys Ser Leu Arg Lys Ser Leu Pro Ala 
20 

Thr Val Phe His He Asn His Gly Asn 
35 

Val Val Asp Lys Trp Pro Asp Phe Asn 
50 

Gin Glu Gin Asp Met Thr Asp Asp Leu 
65 

Tyr Gin He Tyr Ser Lys Asp Pro Gin 
80 

Gly Ser Pro Glu Leu He Asn Trp Lys 
95 

Ser Ser Gin Pro Ser Leu Asn Glu Ala 
110 

He Lys Ser Phe Lys Val Lys Gin Thr 
125 

Ala Ala Glu Thr Ala Lys Glu Leu Thr 
140 

Lys He Leu Ser Pro Ser Gly Gly Lys 
155 

Glu Met Phe Lys Leu ser Ser Met Asp 
170 

Val Asn Lys Phe Trp His Phe Gly Gly 
.185 

Phe He Glu Arg Cys He Gin Thr Phe 

200- 

Gly Pro Glu Gly Thr Pro Val Cys Trp 
215 

Gly Glu Met Arg Met Ala Gly Thr Phe 
230 

Gly Leu Val Thr Tyr Val lie Tyr Ser 
245 

Lys Leu Gly Phe Pro Val Tyr Ser His 
260 

Ala Met Gin Lys Met Ser Tyr Thr Leu 
275 

Arg ser Trp Asn Gin Trp Asn Cys Val 
290 

<210> 39 

<211> 214 

<212> PRT 

<213> Homo sapiens 



PCT/US00/30485 



130 




135 


Gin 


Lys Thr Val Leu 


Glu 


145 


150 


Ser 


Thr Leu Asp Asn 


Thr 


160 




165 


Asp 


He Asn Asp Cys 


Gly 


175 




180 


Ser 


Glu Tyr Glu Val 


He 


190 


195 


Met 


Leu Gin Met Leu 


Glu 


10 




15 


Ser 


Leu Lys Val Tyr 


Gly 


25 




30 


Pro 


Phe Asn Leu Lys 


Ala ' 


40 




45 


Thr 


Val Val Val Cys 


Pro 


55 




60 


Asp 


His Tyr Thr Asn 


Thr 


70 




75 


Asn 


Cys Gin Glu Phe 


Leu 


85 




90 


Gin 


His Leu Gin He 


Gin 


100 




105 


He 


Gin Asn Leu Ala 


Ala 


115 




120 


Gin 


Arg lie Leu Tyr 


Met 


130 




135 


Pro 


Phe Leu Leu Lys 


Ser 


145 




150 


Pro 


Lys Ala He Asn 


Gin 


160 




165 


Val 


Thr His Ala His 


Leu 


175 




180 


Asn 


Glu Arg Ser Gin 


Arg 


190 




1 ft c 

195 


Pro 


. .... . _ 

Thr Cys Cys Leu 


Leu" 


205 




210 


Asp 


Leu Met Asp Gin 


Thr 


220 




one 

225 


Ala 


Glu Tyr Arg Leu 


MIS 


235 






LI-! a 
HIS 


Ala bin Jbys Lieu 




250 




255 


Val 


Asp Tyr Ser Asn 


Glu 


265 




270 


Gin 


His Val Pro He 


Pro 


280 




285 


Pro 


Leu 




295 
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<220> 

<221> misc_feature 
<223> Incyte ID No: 



<400> 39 



Met 


Ala 


Gly 


GlU 


Asn 


1 






5 


Arg 


Gly 


Ala 


Phe 


Ser 
20 


Leu 


Leu 


Leu 


Asp 


Ala 
35 


Val 


Glu 


lie 


Cys 


Leu 
50 


Ala 


Phe 


Leu 


Ala 


Ser 
65 


Thr 


Asp 


He 


Asn 


Pro 








80 


Arg 


Cys 


Asn 


Lys 


Val 
95 


Lys 


Giy 


Leu 


Leu 


Pro 
110 


Phe 


Asn 


Pro 


Pro 


Tyr 
125 


His 


Gly 


He 


GlU 


Ala 








140 


Val 


Met 


Asp 


Arg 


Phe 
155 


Arg 


Gly 


Leu 


Phe 


Tyr 








170 


GlU 


He 


Leu 


Lys 


He 








185 


Ala 


Leu 


Ser 


Arg 


Gin 
200 


Phe 


Thr 


Lys 


Ser 





4344970CD1 



Phe Ala Thr Pro Phe 
10 

Asp Val Tyr Glu Pro 
25 

Leu Glu Ala Ala Ala 
40 

Glu Val Gly Ser Gly 
55 

Met He Gly Pro Gin 
70 

Glu Ala Ala Ala Cys 
85 

His He Gin Pro Val 
100 

Arg Leu Thr Glu Lys 
115 

Val Val thr Pro Pro 
130 

Ala Trp Ala Gly Gly 
145 

Phe Pro Leu Val Pro 
160 

Leu Val Thr He Lys 
175 

Met Lys Thr Lys Gly 
190 

Ala Gly Gin Glu Thr 

. 205 



His Gly His Val Gly 
15 

Ala Glu Asp Thr Phe 
30 

Ala Glu Leu Ala Gly 
45 

Ser Gly Val Val Ser 
60 

Ala Leu Tyr Met Cys 
75 

Thr Leu Glu Thr Ala 
90 

He Thr Asp Leu Val 
105 

Val Asp Leu Leu Val 
120 

Gin Glu Val Gly Ser 
135 

Arg Asn Gly Arg Glu 
150 

Asp Leu Leu Ser Pro 
165 

Glu Asn Asn Pro Glu 
180 

Leu Gin Gly Thr Thr 
195 

Leu Ser Val Leu Lys 
210 



<210> 40 

<211> 322 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature } 

<223> Incyte ID No: 5392302CD1 

<400> 40 

Met Ala Ala Ser Gly Glu Pro Gin Arg Gin Trp Gin Glu Glu Val 

... 1 .. ... . 5 ___ _1_0 15 

Ala Ala Val Val Val Val Giy Ser Cys Met Thr Asp Leu Val Ser" 

20 25 30- 

Leu Thr Ser Arg Leu Pro Lys Thr Gly Glu Thr He His Gly His 

35 40 45 

Lys Phe Phe lie Gly Phe Gly Gly Lys Gly Ala Asn Gin Cys Val 

50 55 60 

Gin Ala Ala Arg Leu Gly Ala Met Thr Ser Met Val Cys Lys Val 

65 70 75* 

Gly Lys Asp Ser Phe Gly Asn Asp Tyr He Glu Asn Leu Lys Gin 

80 85 90 

Asn Asp He Ser Thr Glu Phe Thr Tyr Gin Thr Lys Asp Ala Ala 

95 100 105 

Thr Gly Thr Ala Ser He He Val Asn Asn Glu Gly Gin Asn He 

110 115 120 

lie Val He Val Ala Gly Ala Asn Leu Leu Leu Asn Thr Glu Asp 

125 130 135 

Leu Arg Ala Ala Ala Asn Val He Ser Arg Ala Lys Val Met Val 

140 145 150 

Cys Gin Leu Glu He Thr Pro Ala Thr Ser Leu Glu Ala Leu Thr 
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155 








1 fi^ 

lOD 


Met 


Ala 


Arg Arg 


Ser Gly 


Val 


Lys Tnr Leu Pile Asn rro Aia 


Pro 






170 






1 / D 


1 AO 


Ala 


lie 


Ala Asp 


Leu Asp 


Pro 


Gin Phe Tyr Thr Leu Ser Asp 


vai 






185 






1 Oft 


179 


Phe 


cys 


Cys Asn 


Glu 


Ser 


Glu 


Ala Glu lie Leu Tnr Giy Leu 


Thr 




200 






one 




Val 


Gly 


Ser Ala 


Ala 


Asp 


i> i _ • 

Ala 


Gly Giu Aia Aia Leu vai Leu 


Leu 






215 






x___U 


99^ 


Lys 


Arg 


Gly Cys 


Gin 


Val 


Val 


lie lie Tnr Leu Giy Aia giu 


tsiy 


230 










Cys 


Val 


Val Leu 


Ser 


Gin 


Thr 


Glu Fro giu Pro Lys nis lie 


Pro 






245 






"OCA 




Thr 


Glu 


Lys Val 


Lys 


Ala 


vai 


Asp inr inr *_»iy Aia biy nsp 


Ser 






260 








970 


Phe 


Val 


Gly Ala 


Leu 


Ala 


Phe 


iyr Lieu Aia iyx lyr irro 


f on 

1_ICU 






275 






280 


285 


Ser 


Leu 


Glu Asp 


Met 


Leu 


Asn 


Arg Ser Asn Phe lie Ala Ala 


Val 






290 






295 


300 


Ser 


Val 


Gin Ala 


Ala 


Gly 


Thr 


Gin Ser Ser Tyr Pro Tyr Lys 


Lys 








305 




310 


315 


Asp 


Leu 


Pro Leu 


Thr 


Leu 


Phe 










320 











<210> 41 

<211> 87 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5555235CD1 

<400> 41 

Met Ser Thr Ser Val Pro Gin Gly His Thr Trp Thr Gin Arg Val 

1 5 10 1 . 15 

Lys Lys Asp Asp Glu Glu Glu Asp Pro Leu Asp Gin Leu lie Ser 

20 25 30 

Arg Ser Gly Cys Ala Ala Ser His Phe Ala Val Gin Glu Cys Met 

35 40 45 

Ala Gin His Gin Asp Trp Arg Gin Cys Gin Pro Gin Val Gin Ala 

50 55 60 
Phe Lys Asp Cys Met Ser Glu Gin Gin Ala Arg Arg Gin Glu Glu 

65 70 75 

Leu Gin Arg Arg Glh Glu Gin Ala Gly Ala His His 

80 85 

<210> 42 ■ _ _____ 

<211> 378 

<212> PRT - - 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : 5573296CD1 

<400> 42 

Met Asp Leu Ala Gly Leu Leu Lys Ser Gin Phe Leu Cys His Leu 

1 5 10 15 

Val Phe Cys Tyr Val Phe lie Ala Ser Gly Leu He He Asn Thr 

20 25 30 

He Gin Leu Phe Thr Leu Leu Leu Trp Pro He Asn Lys Gin Leu 

35 40 45 

Phe Arg Lys lie Asn cys Arg Leu Ser Tyr Cys He Ser Ser Gin 

50 55 60 

Leu Val Met Leu Leu Glu Trp Trp Ser Gly Thr Glu Cys Thr He 

65 70 75 
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Phe 


Thr 


Asp 


Pro 


Arg 


Ala 


Tyr 


Leu Lys 


Tyr 


Gly 


Lys 


Glu 


Asn 


Ala 








80 








85 










90 


lie 


Val 


Val 


Leu 


Asn 


His 


Lys 


Phe Glu 


He 


Asp 


Phe 


Leu 


Cys 


Gly 










95 






100 










105 


Trp 


Ser 


Leu 


Ser 


Glu 


Arg 


Phe 


Gly Leu 


Leu 


Gly 


Gly 


Ser 


Lys 


Val 








110 






115 










120 


Leu 


Ala 


Lys 


Lys 


Glu 


Leu 


Ala 


Tyr Val 


Pro 


He 


He 


Gly 


Trp 


Met 








125 








130 










135 


Trp 


Tyr 


Phe 


Thr 


Glu 


Met 


Val 


Phe Cys 


Ser 


Arg 


Lys 


Trp 


GlU 


Gin 






140 








145 










150 


Asp 


Arg 


Lys 


Thr 


Val. 


Ala 


Thr 


Ser Leu 


Gin 


His 


Leu 


Arg 


Asp 


Tyr 




155 








160 










165 


Pro 


Glu 


Lys 


Tyr 


Phe 


Phe 


Leu 


He His 


Cys 


Glu 


Gly 


Thr 


Arg 


Phe 








170 








175 










180 


Thr 


Glu 


Lys 


Lys 


His 


Glu 


He 


Ser Met 


Gin 


Val 


Ala 


Arg 


Ala 


Lys 








185 








190 










195 


Gly 


Leu 


Pro 


Arg 


Leu 


Lys 


His 


His Leu 


Leu 


Pro 


Arg 


Thr 


Lys 


Gly 








200 








205 










210 


Phe 


Ala 


lie 


Thr 


Val 


Arg 


Ser 


Leu Arg 


Asn 


Val 


Val 


Ser 


Ala 


Val 










215 






220 










225 


Tyr 


Asp 


Cys 


Thr 


Leu 


Asn 


Phe 


Arg Asn 


Asn 


Glu 


Asn 


Pro 


Thr 


Leu 




230 








235 










240 


Leu 


Gly 


Val 


Leu 


Asn 


Gly 


Lys 


Lys Tyr 


His 


Ala 


Asp 


Leu 


Tyr 


Val 








245 








250 










255 


Arg 


Arg 


He 


Pro 


Leu 


Glu 


Asp 


He Pro 


Glu 


Asp 


Asp 


Asp 


GlU 


Cys 








260 








265 










270 


Ser 


Ala 


Trp 


Leu 


His 


Lys 


Leu 


Tyr Gin 


Glu 


Lys 


Asp 


Ala 


Phe 


Gin 








275 






280 










285 


Glu 


Glu 


Tyr 


Tyr 


Arg 


Thr 


Gly 


Thr Phe 


Pro 


Glu 


Thr 


Pro 


Met 


Val 






290 






295 










300 


Pro 


Pro 


Arg 


Arg 


Pro 


Trp 


Thr 


Leu Val 


Asn 


Trp 


Leu 


Phe 


Trp 


Ala 










305 








310 










315 


Ser 


Leu 


Val 


Leu 


Tyr 


Pro 


Phe 


Phe Gin 


Phe 


Leu 


Val 


Ser 


Met 


He 










320 








325 










330 


Arg 


ser 


Gly 


Ser 


Ser 


Leu 


Thr 


Leu Ala 


Ser 


Phe 


He 


Leu 


Val 


Phe 






335 








340 










345 


Phe 


Val 


Ala 


Ser 


Val 


Gly 


Val 


Arg Trp 


Met 


He 


Gly 


Val 


Thr 


Glu 










350 






355 










360 


lie 


Asp 


Lys 


Gly 


Ser 


Ala 


Tyr 


Gly Asn 


Ser 


Asp 


Ser 


Lys 


Gin 


Lys 



365 370 375 

Leu Asn. Asp 



<210> 43 
<211> 1322 
<212> DNA 

<213> Homo sapiens 

<22~bV " " 

<221> misc_feature 

<223> Incyte ID No: 016233CB1 

<400> 43 

gcagcatcac gctgacctct gcctgggatg taaaccggac cagccgctgc gggcaaagga 60 

aggctcttgg ctccttcggg aaacccagcc ccgtcaccgg gctccgagcg gctcgcaggc 120 

gagcgacagc gacctcagcc ccggcagcgc ccagcggcgg ctgcggaaag, cggagggagt 180 

ccgacgcggg cgcgggcggg gagcgtgcgt ccgttcgcac aggcagcggg aggaggggcg 240 

gcgcgaacca tggccgggga cagcgagcag accctgcaga accaccagca gcccaacggc 300 

ggcgagccct tccttatagg cgtcagcggg ggaacagcta gcggcaagtc ttccgtgtgt 360 

gctaagatcg tgcagctcct ggggcagaat gaggtggact atcgccagaa gcaggtggtc 420 

atcctgagcc aggatagctt ctaccgtgtc cttacctcgg agcagaaggc caaagccctg 480 

aagggccagt tcaactttga ccacccggat gcctttgaca atgaactcat tctcaaaaca 540 

ctcaaagaaa tcactgaagg gaaaacagtc cagatccccg tgtatgactt tgtctcccat 600 

tcccggaagg aggagacagt tactgtctat cccgcagacg tggtgctctt tgaagggatc 660 

ctggccttct actcccagga ggtacgagac ctgttccaga tgaagctttt tgtggataca 720 

gatgcggaca cccggctctc acgcagagta ttaagggaca tcagcgagag aggcagggat 780 

cttgagcaga ttttatctca gtacattacg ttcgtcaagc ctgcctttga ggaattctgc 840 
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ttgccaacaa 
atcaacctca 
accaatggct 
agcaggccgc 
ggggtcagga 
aaaacaccat 
aaaatgaaac 
gatgcctaat 
aa 



agaagtatgc 
tcgtgcagca 
gtctcaacgg 
attgacccgt 
ggcactgctc 
ggagatgaaa 
agaacttgac 
tatgaatcca 



tgatgtgatc 
catccaggac 
ctacacccct 
ctccatcgga 
atctgtacat 
tgcctttgat 
cctgagctta 
acgtgtaacc 



atccctagag 
atcctgaatg 
tcacgcaaga 
ccccagcccc 
actgtttcct 
tttttttttc 
aataacaaaa 
agttataaat 



gtgcagataa 
gagggccctc 
ggcaggcatc 
tatctccaag 
atgacattac 
tttttgtact 
ctgtgccaac 
acatatatat 



tctggtggcc 
caaacggcag 
ggagtccagc 
agacagagga 
tgtatttaag 
ttggaacgac 
tactactggt 
ataaaaaaaa 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1322 



<210> 44 , 

<211> 1302 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 07833 6CB1 



<400> 44 

aagttttggt 

tgtggtatct 

atccagtggc 

gaaatcaggg 

gctctatcta 

ggtgatatcc 

gatgactgcg 

tatatgggct 

acctgctctt 

atcaccaagc 

tccctgggag 

acaattaacc 

gagcattttc 

ttccctagtg 

tttaaactca 

tgtggaattg 

acatatgatc 

ataggcattg 

ctgaaaaaca 

atgacaatta 

cttaacaggt 

aaaaataaaa 



gtttaaatgt 
acttagcaac 
tttctaaatc 
atccaatact 
gcccaaaaac 
aagaactgta 
atttgccctc 
atctggacta 
tcaatcctgg 
aattggagaa 
gaggggtggc 
ccctgtggca 
tgcaggaagc 
ttcacaacga 
atcaccatag 
tccctttgta 
cacaatataa 
attaacttct 
taaaggaaga 
tatccacaaa 
tgccaacctt 
tattcatcat 



tgtgttctat 
agaaataaag 
tgtaggcaag 
cttctcccat 
aaatgctata 
tgacaccacc 
tgctcaggac 
ccggaagaag 
tgtgattgtt 
atggatgcaa 
cacctcccca 
cataaggcac 
taaattactc 
cttatgggaa 
ctgatataac 
gccaactata 
aaaccaaaaa 
ttaagtacat 
gacccatcta 
tttttaaaac 
ttcttttata 
ttttatgaaa 



ctcttccact 
aaatt agate 
actcttaata 
agcaaaatca 
atgttttact 
ttggccctgg 
ataaacagac 
gecatcaagg 
gecaacatga 
aagaatgtgg 
atgctgattg 
ctgggctgga 
cactggaatg 
agctggtttg 
tctaccctta 
acattgttct 
ctactgtgtg 
gtgataacta 
gataacagca 
ttctacatgt 
aaactattac 
aaaaaaaaaa 



acaaacagga 
agtgccagtt 
aataaatagt 
agagctgttt 
tggtggtgtt 
gccacgcggc 
tcgtgggact 
accttggcat 
cagaatggaa 
aggaaaacct 
tgtttcatgg 
atecagatge 
gaagacataa 
ttcctgaccc 
aaatattccc 
ttatgaatat 
caaattatac 
tggaaatcaa 
atcaacctgc 
atttttcaca 
atttaaaata 
aa 



tacgeattgg 
tgaaetcatc 
caagccactt 
aagctctaag 
tctaaattca 
ggctttctca 
tcagaacaca 
cagccccagc 
gcaccagcgc 
ctatagcagc 
gaaatattcc 
cagatattcg 
accttgggac 
tgcagggata 
tgtatagaaa 
tacctttgat 
cttggaccat 
gattatgtga 
ttaattctga 
tgaagatctc 
tggacgtctg 



<210> 45 

<211> 1771 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<223> Incyte ID No: 



130117CB1 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1302 



<400> 45 

tgacttacag 

tctggtttct 

gatctttctt 

gtccacgcat 

tetgetaegt 

tcctctggcc 

tctcaagcca 

cggacccgcg 

agtttgaaat 

gctccaaggt 

acttcaccga 

ecagtttgea 

geacaeggtt 

tgcctcgcct 



ctcttataaa 
aagtccatgt 
ccttctggaa 
catggacctc 
etttattgee 
cattaacaag 
gctggtgatg 
cgcctacctc 
tgactttctg 
cctggccaag 
gatggtcttc 
gcacctccgg 
caeggagaag 
caagcatcac 



ctagtggcaa 
gecaaagget 
atctttgact 
gcgggactgc 
tcagggctaa 
cagctcttcc 
ctgctggagt 
aagtatggga 
tgtggctgga 
aaagagctgg 
tgttcgcgca 
gactaccccg 
aag cat gaga 
ctgttgccac 



tttctgaacc 
gecaggaagg 
gtgggtagtt 
tgaagtctca 
tcatcaacac 
ggaagatcaa 
ggtggtcggg 
aggaaaatgc 
gcctgtccga 
cctatgtccc 
agtgggagca 
agaagtattt 
teagcatgea 
gaaccaaggg 



cagccggctc 
agacgccttc 
atttatttct 
gttcctgtgc 
cattcagctc 
ctgcagactg 
caeggaatge 
catcgtggtt 
acgctttggg 
aattategge 
ggatcgcaag 
tttcctgatt 
ggtggcccgg 
cttcgccatc 



catctcagct 60 
ctgagtcctg 120 
gaataagagc 180 
cacctggtct 240 
ttcactctcc 300 
tectattgea 360 
accatcttca 420 
ctcaaccaca 480 
ctgttagggg 540 
tggatgtggt 600 
aeggttgeca 660 
cactgtgagg 720 
gecaagggge 780 
accgtgagga 840 
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gcttgagaaa 
atccaacact 
ggatcccact 
tctaccagga 
cgcccatggt 
tggtgctcta 
cgctggccag 
gtgtgacgga 
atgactgact 
catatcctcc 
gaagtcacga 
tttgtaatct 
tgtgtgagaa 
ttaggctgca 
tggtgctgag 
agatgactaa 



tgtagtttca 
gctgggagtc 
ggaagacatc 
gaaggatgcc 
gcccccccgg 
ccctttcttc 
cttcatcctc 
aattgacaag 
cagggaggtg 
ttagtgggac 
cctctccagc 
ttttttcccc 
tggctgtgtg 
ggggagggca 
ttttctgtaa 
attatgcctc 



gctgtatatg 
ctaaacggaa 
cctgaagacg 
tttcaggagg 
cggccctgga 
cagttcctgg 
gtcttctttg 
ggctctgcct 
tcaccatccg 
acggtgacaa 
cagggagtct 
atgtgcttta 
gtgagtgtga 
gggctgggga 
cccttggttg 
caagaaaaaa 



actgtacact 
agaaatacca 
atgacgagtg 
agtactacag 
ccctcgtgaa 
tcagcatgat 
tggcctctgt 
acggcaactc 
aagggaacct 
aggctgggtg 
ggtctcaagg 
gtgggctttg 
actttgttct 
ccgaagggga 
ccagagataa 



caatttcaga 
tgcagatttg 
ctcggcctgg 
gacgggcacc 
ctggctgttt 
caggagcggg 
gggagttcga 
tgacagcaag 
tggggaactg 
agcccctgct 
ccggatgggg 
gttttctttt 
gtgatcatag 
caagttcccc 
agtgaaaagt 



aataatgaaa 
tatgttagga 
ctgcacaagc 
ttcccagaga 
tgggcctcgc 
tcttccctga 
tggatgattg 
cagaaactga 
gtggcctctg 
gggcacggcg 
aggaagatgt 
tgtgcgagtg 
aaagggtatt 
tttcatcctt 
gctttaggtg 



<210> 46 

<211> 1755 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 267495CB1 



<400> 46 
caatccgaag 
gtctgaaata 
tttcacaggg 
gtttaagcca 
acggtggctc 
ggcgggtttc 
gccccctccc 
aggccggcgg 
gaaaggtggt 
ctctcttctc 
ggacatcaca 
ctacacctca 
ggttgttgct 
tatcagtgat 
tcttgttcta 
aatgaatgct 
tgttttttgg 
acaaaatggc 
aaatctaatc 
agcccagcag 
_taccgaggag. 
agcaggcact 
catcacagga 
agaaaatagc 
ctgatgtaaa 
tcatcttgaa 
ggcagtttcc 
aagtgcctgg 
gccttccacc 
gaaaattaac 



gacaggaagg 
attatatcct 
aaccatgttt 
tgggttagag 
cacggactcc 
ttcagtggag 
gctagtcgta 
cgtgggcctt 
agttaagaat 
ctcaagtttg 
gtcaagggcc 
gtcctaactg 
gatagagaag 
gttgcattcc 
agagcaacaa 
gtaggctgct 
acagacactt 
atgcacacat 
aagggaagga 
cttctggaga 
acactttgtg 
ttaaggcaaa 
ggcagcatac 
tcagaatctc 
tttcagccat 
ttttaatcct 
ctcatgctat 
catttcccct 
gtgattgtta 
cctct 



ccatcttaga 
gagagctctc 
ttcaaacaaa 
ggctcgtgag 
tctcggccga 
ccgggctctg 
aagtgcagta 
ttctctgcac 
ttgtaaaggc 
aaatgcttta 
tggaagttgt 
tagggaagga 
aagtggaaca 
ttgtggttgg 
agctgggaat 
gtggtttaca 
ggagaccaga 
tatgtttact 
agatctatga 
ttgttcaaaa 
ttggcttagc 
tgtgcactgt 
atccaatgga 
aaagcatcaa 
atatggattg 
cataattccc 
tctcatgata 
actggctctc 
agtttcctga 



ttgataagcc 
gtagtgtgaa 
ggcataagaa 
cacgcccgat 
ctgacctttc 
gtctccgcag 
aaggcaccag 
ggagccggcg 
cagagaacta 
tctcatcggg 
tagacgctgc 
agccttggaa 
agaagcagat 
tgatccattt 
tccttataga 
gttatataag 
aagcttcttt 
agacatcaaa 
acctccacgg 
tcaaagaata 
cagggttgga 
ggacttggga 
gatggagatg 
tggactttga 
atatggtttg 
aggtgttgtg 
gtgagctttc 
attctcactc 
ggccttccca 



atcttatgtt 
gctttcttct 
cccaggctta 
cctttccctg 
gccgccctgc 
cccagtagcc 
cattttgcgg 
cttttgcagt 
cctacgattc 
ttgggcctgg 
agtcgagtgt 
gagttttatg 
aatattttaa 
ggggccacaa 
gttattcaca 
tttggagaga 
gacaaagtga 
gtaaaggagc 
tatatgagtg 
cgaggagaag 
gccgacgacc 
gaaccattgc 
ctaagtctgt 
acatagatat 
gatgtatccc 
gtaggtaatt 
atgagatctg 
ttgccgccct 
gccatgtgga 



cccagtatct 
acatgtaaaa 
ggaagcgggt 
aggctatgaa 
cccagcagcc 
cgctagcccg 
caccgtagtg 
tgcttctgcg 
tctcagcggt 
gagatgccaa 
atctggaagc 
gaagaaaatt 
aggatgctga 
cacacagtga 
atgcctccat 
cagtttctat 
agaagaacag 
agtctttgga 
taaaccaagc 
aaccagcagt 
agaaaattgc 
attccttgat 
tttccatacc 
ttaccattgt 
cacccaagtc 
gaatcatggg 
atggttttat 
gtgaagaggt 
actgtgagtc 



<210> 47 

<211> 1811 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1771 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320" 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1755 



410533CB1 
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<400> 47 

gcggccgcag 

gcgccgcttg 

cgcccgtgcc 

ttggcttgcg 

ctcccaggcc 

cggcgagcac 

cgagaattgg 

cactaggagt 

actgggccaa 

ttatcagtat 

acatgatcat 

ctcttccaac 

taaaaccaac 

ctttggcagc 

ttacagcttt 

ttcaggtgat 

tcaatgggaa 

aaaaaggact 

aggttgatca 

atttagaatt 

cttttttgta 

ttaagtgaaa 

tcctacacaa 

ttcatgtaat 

taaagttata 

tatgaagttc 

gataacaggt 

aaggcagtga 

tattggaacc 

tttttaaact 

aaaaaaaaaa 



ggccatgcta 
ggctccggga 
ctgctgcttg 
gctgcccggg 
agcggccgga 
ccccagcctg 
cttggcccca 
,ttttgcttta 
tcaaagatca 
cttatatgtt 
ttcgagagat 
accacgaaca 
attcatcagc 
tccagttttc 
caccacagct 
aaaagactga 
cagggcccat 
tgtcagaatc 
tgggaatatg 
tttaatctta 
tgttttttaa 
tttggattgc 
ctgggcaata 
taatacctga 
aaagagatta 
attttcggta 
aacaaagata 
gttacgttcc 
tggattcata 
ttggagactc 
a 



gccttgcgcg 
acgcggccga 
ggctgcctgg 
atcggccagc 
gcgggcgccg 
tatgaaaacc 
gttctgggct 
gctggactaa 
gctttgggaa 
agcttgacct 
gtaatgttga 
cttgccaagt 
aaggtgaata 
aactatgctg 
gcatcagctt 
tgaaagtcat 
ggaaatgtac 
aactgtgtca 
cagaatttcc 
ggtttcttga 
aaacatagtc 
tcacaaagca 
catttttgtt 
tcatttggga 
aaaatgcggt 
gtcttccaac 
gtcgagatag 
tgccttccac 
ttttatgtaa 
tttcttttgt 



tggcgcgcgg 
gtaagcgacg 
ccgaacgctg 
ggaaccactg 
ctgccgaagc 
catggacaat 
atttgattat 
cagatttgtt 
gtgctcttga 
atgcagatct 
ttgctgctgt 
atttcaatcc 
cagcagtcca 
acagcattta 
atagttacta 
ccctcactgt 
aggagtttcc 
tcaaaattta 
aatgtatttt 
ttaatttata 
cagagcatgg 
ctaggaaatg 
tgatttttag 
taatgaaagt 
aactttttaa 
ctctcaggtg 
gagaacgtgt 
tgtgtttctg 
ataatatcaa 
taagcagtta 



ctcgtggggg 
cgcctgctgg 
gaggctgcgt 
ttcgggcgcg 
cccgggcggc 
cccgaatatg 
tgaagaagat 
ggatggattt 
tccacttgct 
tattccagtt 
tttttatgtc 
ttgctatgcc 
gttaatcttg 
tcttcagata 
tcattatggc 
tagtaaggaa 
ctattttggt 
agtaatgtgc 
taaatacaaa 
agagatcaat 
gcagaattga 
tcatggggtt 
gtctgtgtat 
gaagttagtt 
gataataatc 
cctaataatt 
ctattagtct 
acatagcaat 
gctgtatatt 
aaggaataaa 



gccctgcgcg 
gccctgctgc 
ccggccgctc 
gggaaggcgg 
cagtggggcc 
ttgtcaatga 
tttaatattg 
attgctcgaa 
gataaaatac 
ccacttactt 
agataccgaa 
actgctaggt 
gtggcagctt 
ctatggtgtt 
cggaagactg 
gcagtataca 
gttcagcttg 
attgaaaata 
taaaattgta 
tattgtcagt 
cacctctctt 
caaatatata 
acattaacag 
gtagatgaag 
atacagaagg 
tatgtttgag 
ttgcatctaa 
gtttgtttga 
tttcaaaggt 
agagctggaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1 **» £. A 

1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1811 



<210> 48 
<211> 1003 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



852708CB1 



<400> 48 

tcccacgcgt 

ggctcctgga 

agaaacgcga 

ctaagagaac 

ctgctgagtg 

ccatcgcgga 

ttcttatect 

ggtacattcc 

ggcaggatga 

cagaccagcc 

cggataaagt 

actatgcgcg 

ggcagctggt 

aggatgtggt 

tgctggtgag 

ggctcggctg 

cctaatgtga 



ccgcggacgc 
aaggaccgtc 
ttgagaaatg 
tatggcaaca 
cagtgaggcg 
ctacgctgcc 
gagacaaacc 
ggcaaaccat 
agagtacttc 
acgaacaact 
catcctggac 
gcctagagcg 
cctgcagaac 
gctgcccgag 
ggcgggcgtg 
ggccaacctc 
ggtctaatta 



gtgggcggac 
cacccctccg 
gaaaagaaaa 
tcaggtgact 
ggtctcctgc 
"accgatgaga 
actgcagatt 
gtggggaagc 
ggcagctatg 
aaataccaca 
gtgggctgtg 
gtgtacgcgg 
ggctttgctg 
aaggtggacg 
cgggcagctg 
aggatctcaa 
atttcttgtg 



gcgtgggcgg 
cgctggcggt 
tgaaataaat 
gtcccagaag 
aggagggagt 
cccagctcag 
ggtggtgggg 
acgtggatga 
gaactctgaa 
gtgtcatcct 
ggactgggat 
tggaggccag 
acatcatcac 
tgctggtgtc 
ggggccggag 
gggtcgtgcg 
tggaaaaaaa 



acgcgtgggg 
gtggacgcgg 
cagcagttat 
tgaatcgcag 
acagccagag 
ttttttgaga 
tgagcgtgcg 
gtacgacccc 
actccacttg 
gcagaataaa 
catcagtctc 
tgagatggca 
cgtgtaccag 
tgagtggatg 
ctggggggct 
tgattcattt 
aaa 



cggtggctca 
aactcagcgg 
gaggcagagc 
ggagaagagc 
gagtttgtgg- 
ggagaaaaaa 
ggctgctgtg 
gaggacacgt 
gagatgttgg 
gaatccctga 
ttctgtgcac 
cagcacacgg 
cagaaggtgg 
gggacctgcc 
tctgagcacg 
tgatgttttc 



60 
120 
180 
240 
300 
3 60* 
420 _ 
480 
540 
600 
660 
.720 
780 
840 
900 
960 
1003 



<210> 49 
<211> 1687 
<212> DNA 

<213> Homo sapiens 



<220> 



36/55 



WO 01/32888 



PCTYUS00/30485 



<221> misc_f eature 

<223> Ihcyte ID No: 972944CB1 

<400> 49 

gtcgccacgg taacgcgcgc gggcaggtgt ccgaccatga gcgtccgggt cgcacgggta 60 
gcgtgggtca ggggcttggg cgccagctac cgccgcggcg cctcgagctt cccggtgcct 120 
ccgccgggcg cccagggtgt agcggagctg ctgcgagatg cgaccggggc ggaggaggag 180 
gcgccctggg cggcgacgga gcggcgaatg ccgggccagt gctccgtgct gctcttcccg 240 
ggccagggca gccaggtggt gggcatgggc cgcggtctgc tcaactaccc gcgcgtccgc 300 
gaactctacg ccgccgcccg ccgcgtgctg ggctacgacc tgctggaact gagcctgcac 360 
gggccgcagg agaccctgga ccgcaccgtg cactgtcagc ccgcgatctt cgtggcatcg 420 
ctggccgctg tcgagaaact acatcacctg cagccctcgg tgattgagaa ctgtgttgct 480 
gctgctggat tcagtgtggg agagtttgca gccctagtgt ttgccggagc catggaattt 540 
gctgaaggtt tgtatgcagt gaaaatccga gctgaggcca tgcaggaagc ttcagaagct 600 
gtccccagtg ggatgctgtc tgtcctcggc cagcctcagt ccaagttcaa cttcgcctgt 660 
ttggaagccc gggaacactg caagtcttta ggcatagaga accccgtatg tgaagtgtcc 720 
aactacctct ttccagattg cagggtgatt tcaggacacc aagaggctct acggtttctc 780 
cagaagaatt cctctaagtt tcatttcaga cgcaccagga tgttgccggt tagtggcgca 840 
ttccacaccc gcctcatgga gccagccgtg gagcccctga cgcaagcttt aaaggcagtc 900 
gacattaaga agcctctggt ttctgtctac tccaacgtcc acgcgcatag atacaggcat 960 
cccgggcaca tccacaagct gctggcccag cagctggtct ccccagtgaa gtgggagcag 1020 
acgatgcatg ccatatacga aaggaaaaag ggcagggggt tcccccaaac tttegaagta 1080 
ggccctggca ggcagctggg agccatcctg aagagctgta acatgcaggc ctggaagtcc 1140 
tacagcgccg tggatgtgct gcagaccctc gaacatgtgg acctggaccc tcaggagccc 1200 
ccgagatgac tgcagggggc tcaaatgcga tgaccccctc tgtcctcctg aggagaggct 1260 
gtaggctgtg cctgtcgccc cctaccttcc taatggctcc tcctctgagg agtgaaaggg 1320 
atttgtttgc aacgtgcttt gaaggccaca taaaaagccc . taaaaatgag tatttcttta 1380 
catgacccag tccatttctc ccccttggaa aaacgtttgg acgttgggaa gaatgatgcc 1440 
acagggctgg tgtgtgggga agctggaatc ctggccccgc ctctgccagc cctgcaccat 1500 
aggcaggtgt gccatctcag cgggaagggg aggactggct gctgcagcct gtcctgctct 1560 
gtgagtgctg agaacatgct ggtgtggcct caatttcagc cagtgccagg ggagcccctc 1620 
caccaccccc agcctgcccc ccagggctcc tttgtcacaa atctgaaagg ttttagaaaa 1680 
aaaaaaa 1687 

<210> 50 
<211> 1239 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 99773 0CB1 

<400> 50 

ctcacgaggc cgtgggtacg accggaagcc gcagcatgct gatcccattt tcaatgaaga 60 
attgcttcca gttactttgt aactgccagg tcccagcagc tggctttaaa aaaacagtaa 120 
aaaatgggct cattttacag tcaatttcca atgatgtcta tcaaaatctg gctgtggaag 180 
actggatcca. tgaccatatg .aatctagaag gcaaaccaat tctattcttt tggcagaatt 240 
ctccctctgt tgtaattggt aggcatcaaa atccttggca ggaatgtaac ctgaat'ctaa 300 
tgagagaaga aggtataaaa ctggctcgga gaagaagtgg aggaggaaca gtctaccatg 360 
atatgggtaa tatcaatttg actttcttta caaccaaaaa aaagtatgat agaatggaaa 420 
atctgaaatt aattgtgaga gctctgaatg ctgtccaacc ccagctggat gtgcaggcta 480 
ccaaaagatt tgacctttta cttgatggac agtttaaaat ctcaggaaca gcttctaaga 540 
tcggccggac tactgcctat caccattgca ctttattatg tagtactgat gggacgttcc 600 
tgtcttcttt gctaaagagc ccttaccaag ggatcaggag caatgccact gctagcatac 660. 
cttccttagt gaaaaatctt ttggaaaagg atcccactct gacctgtgaa gtactaatga 720 
atgctgttgc tacagagtat gctgcctatc atcaaattga taatcacatt cacctaataa 780 
acccaacgga tgagacactg tttcctggaa taaatagcaa agccaaagaa ctgcaaactt 840 
gggagtggat atatggcaaa actccaaagt ttagtataaa tacttccttt catgtgttat 900 
atgaacagtc acacttggaa attaaagtat tcatagacat aaagaatgga agaattgaaa 960 
tttgtaatat tgaagcacct gatcattggt tgccattgga aatacgtgac aaattaaatt 1020 
caagtcttat tggcagtaag ttttgeccaa ctgaaactac catgctaaca aatatattac 1080 
ttagaacatg tccacaagac cacaaactaa acagtaaatg gaatattctc tgtgaaaaaa 1140 
ttaagggaat aatgtgattc caagtaaatg tcttaataca gtttcaatta gaaaataaaa 1200 
tgtctcatac ttgcacattg tatgtcaaaa aaaaaaaaa 1239 

<210> 51 
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840 
900 



<211> 1594 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 12859 44CB1 

<400> 51 

gtcggagccc taaccagggg tatctctgag cctggtggga tccccggagc gtcacatcac 60 
tttccgatca cttcaaagtg gttaaaaact aatatttata tgacagaaga aaaagatgtc 120 
attccgtaaa gtaaacatca tcatcttggt cctggctgtt gctctcttct tactggtttt 180 
gcaccataac ttcctcagct tgagcagttt gttaaggaat gaggttacag attcaggaat 240 
tgtagggcct caacctatag actttgtccc aaatgctctc cgacatgcag tagatgggag 3 00 
acaagaggag attcctgtgg tcatcgctgc atctgaagac aggcttgggg gggccattgc 360 
agctataaac agcattcagc acaacactcg ctccaatgtg attttctaca ttgttactct 420 
caacaataca gcagaccatc tccggtcctg gctcaacagt gattccctga aaagcatcag 480 
atacaaaatt gtcaattttg accctaaact tttggaagga aaagtaaagg aggatcctga 540 
ccagggggaa tccatgaaac ctttaacctt tgcaaggttc tacttgccaa ttctggttcc 600 
cagcgcaaag aaggccatat acatggatga tgatgtaatt gtgcaaggtg atattcttgc 660 
cctttacaat acagcactga agccaggaca tgcagctgca ttttcagaag attgtgattc 720 
agcctctact aaagttgtca tccgtggage aggaaaccag tacaattaca ttggctatct 
tgactataaa aaggaaagaa ttcgtaagct ttccatgaaa gccagcactt gctcatttaa 
tcctggagtt tttgttgcaa acctgacgga atggaaacga cagaatataa ctaaccaact 
ggaaaaatgg atgaaactca atgtagaaga gggactgtat agcagaaccc tggctggtag 960 
catcacaaca cctcctctgc ttatcgtatt ttatcaacag cactctacca tcgatcctat 1020 
gtggaatgtc cgccaccttg gttccagtgc tggaaaacga tattcacctc agtttgtaaa 1080 
ggctgccaag ttactccatt ggaatggaca tttgaagcca tggggaagga ctgcttcata 1140 
tactgatgtt tgggaaaaat ggtatattcc agacccaaca ggcaaattca acctaatccg 1200 
aagatatacc gagatctcaa acataaagtg aaacagaatt tgaactgtaa gcaagcattt 1260 
ctcaggaagt cctggaagat agcatgcgtg ggaagtaaca gttgctaggc ttcaatgcct 1320 
atcggtagca agccatggaa aaagatgtgt cagctaggta aagatgacaa actgccctgt 1380 
ctggcagtca gcttcccaga cagactatag actataaata tgtctccatc tgccttacca 1440 
agtgttttct tactacaatg ctgaatgact ggaaagaaga actgatatgg ctagttcagc 1500 
tagctggtac agataattca aaactgctgt tggttttaat tttgtaacct gtggcctgat 1560 
ctgtaaataa aacttacatt tttcaaaaaa aaaa 1594 

<210> 52 
<211> 700 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No: 12932 07CB1 

<400> 52 

cccggcctgc ggtgggcagc agctcaggtt ctccaaatca ttgcgtagtt ccgaataccc 60 _ 

tcggccacac ctggccttct ccatgctcgg aataacttcc tgcagcgacc aacaggctaa 120 

agagggggaa gggatccagc accggctcct cctccggcaa ccacggtggg agcggcggag 180_ 

gaaatggaca taaacccggg tgtgaaaagc cagggaatga agcccgcggg agcggggaat 240 

ctgggattca gaactctgag acgtctcctg ggatgtttaa ctttgacact ttctggaaga 3 00 

attttaaatc caagctgggt ttcatcaact gggatgccat aaacaagaac caggtcccgc 3 60 

cccccagcac ccgagccctc ctctacttca gccgactctg ggaggatttc aaacagaaca 420 

ctcctttcct caactggaaa gcaattattg agggtgcgga cgcgtcatca ctgcagaaac 480 

gtgcaggcag agccgatcag aactacaatt acaaccagca tgcgtatccc actgcctatg 540 

gtgggaagta ctcagtcaag acccctgcaa aggggggagt ctcaccttct tcctcggctt 600 

cccgggtgca acctggcctg ctgcagtggg tgaagttttg gtaggtgagt gtcagagtga 660 

gccgacccag gccacatcct ggcagtggag gcacagtcac 700 

<210> 53 
<211> 536 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte ID No: 1308125CB1 
<400> 53 

gttttaggcg agttggagat gtctgtgaga ggtctgaaca gtgatgtcca gttcaaggat 60 

ggagggcaaa gccaagtaca tcctccctac cgaaacaata tatgttgggg. aaatgaagga 120 

tggcatgttt cacggcgagg gaaccctgta cttccccagc ggaagccaat acgacgccat 180 

ttgggaaaac ggattggcca taaaggtatg gctcaactca ccaatatgga cccacctaga 240 * 

aaaatcccca agggctatta cgattgtgga gacggcttct ataacccagt cacgagggta 300 

gtcaaggact ataggaaccg ctttctaaga aacgcagatg atgacgagca tgagtggatc 3 60 

acccgtacct gtcgaaaggg ctaggatgag atcgtgggtc acaggcccga gccgtgaact 420 

ctgtggctgc ctccaccaga ggtttccatc tgccctacta gcattggctg ccctggggga 480 

cgggctgtag ttctagaacc tgattttaac tcaggaataa agactttctg cggtca 536 

<210> 54 

<211> 1130 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1439670CB1 

<400> 54 

agaggcgagg tggaggcgtt ggcgctgcca cgtctgggcc gcggttccca actgtggcgc 60 

gggcggtgga ggaggaggtg gggctggcgc tgaagccgga tccggatccg gtgctgtgca 120 

cactggtggg ggagagtccg acgcgcctgg ctaggagcgc cgaccgcagg gcctctacgg 180 

accttactag aaaaatgaaa cctgatgaaa ctcctatgtt tgacccaagt ctactcaaag 240 

aagtggactg gagtcagaat acagctacat tttctccagc catttcccca acacatcctg 300 

gagaaggctt ggttttgagg cctctttgta ctgctgactt aaatagaggt ttttttaagg 360 

tattgggtca gctaacagag actggagttg tcagccctga acaatttatg aaatcttttg 420 

agcatatgaa gaaatctggg gattattatg ttacagttgt agaagatgtg actctaggac 480 

agattgttgc tacggcaact ctgattatag aacataaatt catccattcc tgtgctaaga 540 

gaggaagagt agaagatgtt gttgttagtg atgaatgcag aggaaagcag cttggcaaat 600 

tgttattatc aacccttact ttgctaagca agaaactgaa ctgttacaag attacccttg 660 

aatgtctacc acaaaatgtt ggtttctata aaaagtttgg atatactgta tctgaagaaa 720 

actacatgtg tcggaggttt ctaaagtaaa aatcttgtaa gaaaattgtc aaaggggcta 780 

atgctacaag gctacactct tcctagagtt gaaatatttt gttgctgcag ccgagtgacc 840 

tccataaata ctggactgaa aaaacattgt aatactacaa gtataatgac atttagaaga 900 

ttactttggg ctggtgggac atgctgtgaa tttagattac aaatgaatat tataacaggg 960 

gatgattttt aaccaaacgg aatatatttt taacttggat cttttcttgc attgtatttt 1020 

ttctaaaagg tttggcatcc ctatcttggg aaggtcagga ggtatggggt aaataaaggg 1080 

agttaataat ggtctggcta atccggtggt ttggcctcat tttataaaaa 1130 

<210> 55 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> " 

<221> mis cofeature - - - • 

<223> Incyte ID No: 1444281CB1 

<400> 55 

ctccgagagc gggccgggct cagttcagct gctgtccaga cccggatcgg caacagtgcc 60 
gcctccagac gttctcctgc cgctcgcccg cc cgtc c cag cgcccc cage cctcccgcga 120 
gggcgccccg ggaeggaagg atccaccagt ctgtcggcgc ccgccgttct cgtggtcgcc 180 
gtcgccgtcg tcgtggtggt agtctccgcc gtcgcctggg ccatggccaa ttacatccac 240 
gtccctcccg gctccccgga ggtgcccaag ctgaaegtea ccgttcagga tcaggaggag 300 
catcgctgcc gggagggggc cctgagcctc ctgcaacacc tgcggcctca ctgggacccc 360 
caggaggtga ccctgcagct cttcacagat ggaatcacaa ataaacttat tggctgttac 420 
gtgggaaaca ccatggagga tgtagtcctg gtgagaattt atggcaataa gactgagtta 480 
ttagtcgatc gagatgagga agtaaagagt tttcgagtgt tgeaggctea tgggtgtgca 540 
ccacaactct actgtacctt caataatgga cfcatgctatg aatttataca aggagaagca 600 
ctggatccaa agcatgtctg caacccagcc attttcagtt tatcategtt gaetctttge 660 
aaaggaaaaa ctacaagatg ttttggatta accggctgca gagggtcaag gettctgett 720 
agttttttct agttagtgtt tgagttgaag tgttatacat ttcttactgc egattgetta 780 
ttacttaatt gttcatgttt tcagagttct tgcctattaa aattttatta atttgggtaa 840 
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caattctttg ttctttacgt tgttcattat ttaatttaat atattatgtt ttctataaag 900 
aataatgtca tatatgcttg atcatttttg 930 

<210> 56 

<211> 1687 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1450140CB1 

<400> 56 

gtccgcgggg agctgcccct gtgcccgtgg ctctctgcgc atgcccggac cccggcggct 60 
ccggagtccg ggaggctctg gctgctccag tcgcgggtcc gggccccgcc tctccactcg 120 
gcggtggcgt ccgtgttgt-t attgtggtgg tagcggcgtc tggctgctgc ggaccccgcc 180 
gagtcctagc gcctggcctg cgcgccgctg cccgcgccac aggattaatt tatttttgga 240 
aatcaagtgc aatattggaa gccatttcta ctaattttat ttcactttta aatttatgat 300 
ttgatttgaa tactatcatg ggaggagctg tgagtgctgg ggaagataat gatgacttaa 3 60 
ttgataattt aaaagaagct cagtatattc gtactgaaag agtggagcaa gccttcagag 420 
cgattgatcg tggagattac tatttggaag gctacagaga caatgcttac aaagacttag 480 
cctggaagca tggaaacatc cacttgtcag caccttgcat ttattctgaa gttatggaag 540 
cattgaaact tcaaccagga ttgtcttttc ttaacctggg aagtggaacc ggatatttaa 600 
gtacaatggt gggcttaatt ttaggtcctt ttggaataaa tcatgggatt gagcttcatt 660 
cagatgtggt ggaatatgcc aaggaaaaac tggagagctt catcaaaaat agtgatagct 720 
ttgataaatt tgagttctgt gaacctgcat ttgttgttgg taattgcctc cagatagctt 780 
ctgacagtca tcagtatgat cgaatttatt gtggagctgg agtacagaaa gaccatgaaa 840 
actacatgaa aatattacta aaagttggag gcatattagt catgcctata gaggatcagt 900 
taacacagat tatgcgaact ggacagaaca cttgggaaag taaaaatatc cttgctgttt 960 
catttgctcc acttgtgcaa ccaagtaaga atgataatgg caaaccagat tctgtgggac 1020 
tccctccctg tgctgtcagg aatctacagg acttggctcg tatttacatt cgacgcacac 1080 
ttagaaattt cataaatgat gagatgcagg ccaaggggat tcctcaaagg gctccaccca 1140 
aaaggaaaag aaagagagtt aaacagagaa ttaacactta cgtatttgtg ggtaatcagc 1200 
ttattcctca gcctctagac agtgaagagg atgaaaaaat ggaagaggat atcaaagaag 1260 
aggaggaaaa agatcacaat gaagcaatga agccagagga gccacctcaa aatttactga 1320 
gagaaaaaat catgaagctg cccctccctg aatctttaaa agcttacttg acatatttta 1380 
gagacaaata acttagatca agaagaaaaa tgcctactga taattccttt agtcttgaaa 1440 
atgtagcatt tgttaggagt taaaagagag aattatttct ttcatcagag caaattatag 1500 
tggaaaaaaa aatcacttgt ttctgtcagt aaeacaaatg atgtattcag tgaataaaag 1560 
aatccctttt ataaaatcta tttttcttta aatcttggaa aaatgttgtt ttagctcaga 1620 
gtgatttcaa agtggaatgc aacagtagtc aagacttgtg tactataaat ccttttctga 1680 
ttcctta 1687 

<210> 57 

<211> 1262 

<212> DNA 

<213> Homo sapiens 

<220> " ~ " 

<221> misc^feature 

<223> Incyte ID No: 1604828CB1 

<400> 57 

ggacgccgcg cctcctctcg ggcggagcgg cgcggtggct gatcagagcg cgtagggctt 60 

cgccggtgcc gggcgctggg cgcggtcctg ctcagcccag ctcaccgcgc gccggccctc 120 

ggcgccctgg ttctgcggat caggagaaaa taatgaatgt cagaggaaaa gtaattctgt 180 

caatgctggt tgtctcaact gtgatcattg tgttttggga atttatcaac agcacagaag 240 

actctttctt gtggatatat cactcaaaaa acccagaagt tgatgacagc agtgctcaga 300 

agggctggtg gtttctgagc tggtttaaca atgggatcca caattatcaa caaggggaag 360 

aagacataga caaagaaaaa ggaagagagg agaccaaagg aaggaaaatg acacaacaga 420 

gcttcggcta tgggactggt ttaatccaaa cttgaaggaa tccgaataac taaactggac 480 

tctggttttc tgactcagtc cttctagaag acctggactg agagatcatg cggttaagga 540 

gtgtgtaaca ggcggaccac ctgttgggac tgcgagattc tcaaggggaa ggactgggtc 600 

tcatttctcc catctcagcg cttagcagga tgacctggta tagagcaggg aactgggaaa 660 

tgtgggtcag gggatcagac actccagttg ggtcttttat ataaattaaa tggcaaaagg 720 

ctccataccc ttctccttct ttcctaccct ccactttatc tgcaaaatgg gaatgatgat 780 

aacacccact tcatagaatg gtcatgaaga tcaaatgaga gaataaaagt caagcactta 840 
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gcctctggtg cacaataagt attaaataag 
aataatatta ccaaatgtcc agcttataca 
tagagctact aaaagatctt tgacaagcta 
aacaagagag ttttaagttc agaaatggtt 
ttttggtttt tggtttctgg tttacaatct 
aactaaaagc accatgtaag ctactaaaaa 
tctgaattga caaaaatgcc aatgcaaata 



tatacctatt cctccttttc cttttttaaa 900 
catttacaag acttagctag tgggctatgt 960 
aaactaagat gcaatgaatg aggtgtaacg 1020 
acagaagtat aagacagctg tgtgggtgtt 1080 
cgtcattcaa caaagatggg agttttatag 1140 
caacaacaaa aaaggctcat catttctcag 1200 
aaaatgatta ctttttattt taaaaaaaaa 1260 

1262 



<210> 58 

<211> 1330 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1644023CB1 

<400> 58 

gcgaagcggg cgtgcgccag tgcagtgccg cgggccgcgc gcgactcccg atagggaagg 60 
gccggagctg cgcaggggcg cgcggaagag gccaagctag ccggaacccc gccccgcccc 120 
gccccgcccc aacgggggct ggaacccggc gccgagagta gagaaaaggg gcctctggtg 180 
accgccccta cctggcatcc ctctaaccca ggaggagcgt ggggaaaggg gctgtgggcc 240 
tctcggggag cgagctgcgg gtagcggcgc actgggtaca ggcgcgcgct tggctgtcgc 300 
ctctgccgct gtgtttggga ggactcgaac tggcgccagg aaatattagg aagctgtgat 360 
tttcaaagct aattatgaaa acatttatca ttggaatcag tggtgtgaca aacagtggca 420 
aaacaacact ggctaagaat ttgcagaaac acctcccaaa ttgcagtgtc atatctcagg 480 
atgatttctt caagccagag tctgagatag agacagataa aaatggattt ttgcagtacg 540 
atgtgcttga agcacttaac atggaaaaaa tgatgtcagc catttcctgc tggatggaaa 600 
gcgcaagaca ctctgtggta tcaacagacc aggaaagtgc tgaggaaatt cccattttaa 660 
tcatcgaagg ttttcttctt tttaattata agccccttga cactatatgg aatagaagct 720 
atttcctgac gattccatat gaagaatgta aaaggaggag gagtacaagg gtctatcagc 780 
ctccagactc tccgggatac tttgatggcc atgtgtggcc catgtatcta aagtacagac 840 
aagaaatgca ggacatcaca tgggaagttg tgtacctgga tggaacaaaa tctgaagagg 900 
acctcttttt gcaagtatat gaagatctaa tacaagaact agcaaagcaa aagtgtttgc 960 
aagtgacagc ataaagacgg aacacaacaa atccttcctg aagtgaatta ggaaactcca 1020 
aggagtaatt taagaacctt caccaagata caatgtatac tgtggtacaa tgacagccat 1080 
tgtttcatat gtttgatttt tattgcacat ggttttccca acatgtggaa caataaatat 1140 
ccatgccaat ggacaggact gtaccttagc aagttgctcc ctctccaggg agcgcataga 1200 
tacagcagag ctcacagtga gtcagaaagt ctccactttc tgaacatagc tctataacaa 1260 
tgattgtcaa acttttctaa ctggagctca gagtaagaaa taaagattac atcacaatcc 1320 
aaaaaaaaaa 133 0 

<210> 59 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> _ ... . - - - - ~ 

<221> misc_feature 

<223> Incyte ID No: 1723402CB1 

<400> 59 

taagctgaat cgactgctgc caaacatcta ttaggcaaaa ttggcctctt gcccatgatt 60 
tgactttcca gcacagccag ttcttttcct cctctgcagc tgattggctc tggagtgtgg 120 
ccagaagcct ctctcctgca attaaaggag tcgggtctct aactgttgat ctgtitttttt 180 
cccttctgag caatggagct taccatcttt atcctgagac tggccattta catcctgaca 240 
tttcccttgt acctgctgaa ctttctgggc ttgtggagct ggatatgcaa aaaatggttc 300 
ccctacttct tggtgaggtt cactgtgata tacaacgaac agatggcaag caagaagcgg 360 
gagctcttca gtaacctgca ggagtttgcg ggcccctccg ggaaactctc cctgctggaa 420 
gtgggctgtg gcacgggggc caacttcaag ttctacccac ctgggtgcag ggtgacctgt 480 
attgacccca accccaactt tgagaagttt ttgatcaaga gcattgcaga gaaccgacac 540 
ctgcagtttg agcgctttgt ggtagctgcc ggggagaaca tgcaccaggt ggctgatggc 600 
tctgtggatg tggtggtctg caccctggtg ctgtgctctg tgaagaacca ggagcggatt 660 
ctccgcgagg tgtgcagagt gctgagaccg ggaggggctt tctatttcat ggagcatgtg 720 
gcagctgagt gttcgacttg gaattacttc tggcaacaag tcctggatcc tgcctggcac 780 
cttctgtttg atgggtgcaa cctgaccaga gagagctgga aggccctgga gcgggccagc 840 
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ttctctaagc tgaagctgca gcacatccag gccccactgt cctgggagtt ggtgcgccct 900 
catatctatg gatatgctgt gaaatagtgt gagctggcag ttaagagctg aatggctcaa 960 
agaatttaaa gcttcagttt tacatttaaa atgctaagtg ggagaagaga aacctttttt 1020 
ttggggggcg gtttttttgg tttgttgttg gttttttttt tttttttggc ctgggtgaca 1080 
agagcaagac tccgtctcaa aaaaaaaaaa 1110 

<210> 60 
<211> 1153 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



1740585CB1 



<400> 60 

ggtagagcgg 

cgagaacatc 

tggcctcccc 

ggagcagggt 

cgccggggca 

gaagtacctg 

caacaggac t 

attttctgca 

acctttgttt 

ccagctggct 

gacctgttta 

gatgcgaaga 

cacaattccg 

cactgctttg 

cagaattatt 

atctcagaca 

agcagaagaa 

aaaatatctg 

tgagaaagat 

caaaaaaaaa 



agacgacgct 
agaggaaccg 
gcggcaggaa 
tgggccatcg 
agagcgcgac 
gaatacttct 
gaagccatgt 
gcatttgagg 
ttggttttgg 
cggaaatatt 
cagaggaatg 
aagctagaaa 
agtccagcat 
gaaaatccag 
tgttcaacta 
atgaaggaag 
cttaacaagc 
tgctttcagc 
aatattgtac 
aaa 



cccagactcc 
gcagcgacgg 
aatcgacttt 
gtgttgtcgc 
cggcgccatc 
tgatggctgt 
gggaagattt 
cccagtcttg 
atgacaattt 
cattgggctt 
gccagagacc 
agcccaaccc 
gtgcttcgga 
taaaatatgc 
acattcttca 
caaaagatga 
tcaaagcaga 
aaaccattga 
agaagtattt 



tccggtctcc 
gccgcggaaa 
cgcgcgcgcc 
gtatgatgac 
ccaatggaaa 
cattaatggg 
tataacctgc 
ctacctctta 
- ttattatcag 
ttgccagctc 
acaggcactg 
tgagaaaaat 
ggccagcctg 
tgaggacaat 
taaaactgat 
acaagtgctt 
gtttttggaa 
cataccagat 
ttcaaagcag 



ccgggcagca 
cgaggcctct 
ctcgcccacc 
gtcatgcccg 
ttgcttcgac 
tgtcagatgt 
ttaaaggatc 
acaaaaactg 
agtatgagat 
tttttagatt 
cctcctgaga 
gcttgggaac 
gaagtgactg 
atggaacaaa 
cagacactcc 
cctcacaact 
gacctaaaac 
gtcatttctt 
cattaaaatt 



tgaagaccgc 
gcgtcctctg 
ggctgcagca 
acgcgtttct 
aggaactgtt 
ctotcccacc 
aagatctgat 
ctgtttctag 
atgaagtcta 
gtcctcttga 
ccatccacct 
acaacagcct 
atttattgct 
aggacacaga 
gaaggattgt 
tgaagcttct 
aaggaaacaa 
tttttcatta 
tctgaactgc 



60 

120 ■ 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1153 



<210> 61 

<211> 1955 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_ feature 
<223> Incyte ID No: 



1810925CB1 



<400> 61 

gggcaggcta 

gtgtatcctc 

atctgaagtg" 

ggatccaatt 

aggtctcaag 

tggccctata 

tgtatcacat 

aatatcagat 

acacaaacgt 

attgtgagat 

gtgaggggat 

tcctcttggt 

gctgggctga 

atgcccagct 

atgagaggag 

tcggcccaga 

tggcctacag 

acatgaaata 

aagactcccg 

aatggacttg 



agagatcttc 
tgcctttttt 
gage ttc tag 
gtgtccattg 
gtctgaagca 
caaatccttg 
caatcaeggg 
ggttattatc 
atategtatg 
cgtaaactgg 
aagagtggct 
tacggaagat 
gacaggccac 
ggatgtctct 
cctgcattac 
£T<J<jagtcccg 
catggaaaaa 
tetgegtcag 
cagatttgtg 
ctacccacag 



ttttaattca 
cttatctatt 
tatccccagg 
atctctcaga 
tcccacagaa 
gecaggagea 
aaccccttca 
cggcctcaaa 
ttctccaaag 
aaacagagac 
acattttcaa 
attctgaagc 
ccagatgatg 
tattctgggc 
ateaageget 
gtctcatggg 
taccgaagga 
aagaatattc 
gggcagtttg 
aatctagttc 



gectgettaa 

ggaggaagct- 

agegegaagt 

gtggctgagg 

tgatcctact 

tccctgagtc 

acatggaggt 

agcaggagat 

agectcaaaa 

tccaaatcca 

agtcagtgaa 

tcaatgcctc 

aatttgaaag 

tggtaaatga 

gcatagaaga 

taaccatgga 

caggcaacat 

cattttacat 

gtttctttga 

cattttagac 



gaegggaact 
cagatggtgt 
gaacacggaa 
ataatagagt 
gaataactcc 
cctgaaggtg 
gctggtggat 
gactgatgac 
atcagaagaa 
aggtcttcaa 
agtagagcat 
cagtaaaagc 
tgaaactccc 
caactggaag 
cctgccagca 
cccttcttgt 
ggcacgagtg 
ctctgtgttg 
ggcctcctgt 
aatgaagctg 



gataactgta 
-cacaagaagg 
ggtacctgea 
ttcttcttca 
cataagctgc 
tatggctctg 
tcctggcctg 
atggattcat 
gttttgaaaa 
gaaagtttag 
tcgagagcac 
aagcttggaa 
aactttaagt 
cgagggaaga 
gcctgtatgc 
gaagtaggaa 
atggtgcgat 
gaagaaaatg 
gagtggcacc 
cttagtaatc 



60 

120- 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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tctgccaagc 
aatgcatatt 
aaagacacag 
ttctttaaat 
gaaagccagg 
. gaatctacta 
aatctgtatg 
atggtacact 
agacgaatgg 
atcaaatgga 
ctcctactct 
tgcgagtctg 
ggggaattcc 



catctcttaa 
gggcacttat 
ccatgctctt 
atgcttaagt 
ctcagttctt 
tctggaagat 
gatactttat 
aaagaatgga 
ctttcccagg 
ctggccttgg 
ttctcacatc 
gttgaaagtt 
ggggaaaatt 



tattaaagca 
aatacagcag 
gaggagctta 
gttataggga 
acctctggga 
aaatgaagga 
caaaaatgaa 
atcagttcag 
ctaccttcca 
gggtctctgc 
ctcttcatga 
cagcctttgg 
ttgcccctaa 



gacaccacag 
gaactcttct 
caatcctggc 
aagacggggt 
atcagaactc 
ttttaataaa 
tgtccctgct 
tgagtagaaa 
tcactgttgt 
tatggaaatg 
ttgacgcatg 
tgtttgcaca 

ggttg 



aatagatttc 
cacctggagc 
tggaggcagg 
taccagtaaa 
tttatgcaac 
attttcaata 
atttctggat 
ggtatccaag 
acagaaaaga 
ccattttttg 
gcacaaggca 
actgctaaca 



ttcacttaca 
cttgatgtta 
ggagggtata 
catgtaacta 
ttggttaata 
gaataaacct 
ttatgaggca 
gtgaagcctg 
aatccagaga 
tgtctccttt 
aggtgtggcc 
aaaggggaca 



<210> 62 
<211> 1434 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



1915064CB1 



<400> 62 

tcgggcggtg 

ggcggcgtca 

ccgagcggcc 

gagccgcagt 

gactttctca 

tgcagccaga 

gagtgcgccc 

ctgagcggag 

tgtcgtcttt 

gaaattgtac 

aacgcacggg 

cagctgaagc 

ctagtggcca 

gtggggctga 

ctgctccttg 

gtatttggag 

ctttaaacag 

agactgagct 

cgcaggtcta 

accagtttgg 

tagccagttt 

taatttgggg 

gctttgtgtt 

ttgtatataa 



ccccgggctg 
tgtcctccga 
tgcgcatggt 
gggtcccgga 
ccagaaagca 
aggtgccgct 
tggtgtccct 
ccaccttcct 
ccaataacca 
acatttccac 
ccacaggcat 
tgacagtggt 
tgcacaaggc 
gcttggagta 
cgtaccacag 
aaacaggagt 
cttcatgttt 
acactactgc 
ctagaaggt t 
gcagtaggaa 
acagcacttg 
cttctgggct 
ttgtactttt 
agcttatgat 



ggcgaggggc 
ggtgtccgcg 
gcccgaacac 
caaggagtgt 
ccactgtcgc 
gcggcgcatg 
caaggaggcg 
cgtcacgttt 
gagatacttg 
cgtgcagatc 
gttcctgcag 
ggaggacgtg 
tgccaagctc 
cgtgtggtca 
ggattaatcc 
gttcacttat 
tagaatttgt 
taaactattt 
ctggcccatc 
ctcaggcttc 
ccttagcctg 
gtgagtgatc 
cactcactat 
taaaaactat 



cgggtgcggg 
cgccgcgacg 
cgcgccttcg 
cggagatgta 
cgctgcggga 
tgctttgtgg 
gagttctacg 
ggaaactcag 
tttctggatg 
ctcacagaag 
tatacagtgc 
actgtgggca 
ctctatgaat 
ccaggactga 
tgcttgtgct 
ctagtgcaat 
gtattgtcca 
ttagcataat 
aatattcatt 
tggtctgcag 
tttcacagac 
ctttgatact 
ttcactttat 
tttgaacata 



<210> 63 
<211> 1360 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



2185608CB1 



<400> 63 

ccgataacga 

acccaggctt 

tccagggccc 

atgccagtgg 

ggttgccgca 

atccagaaag 

agcggcggtt 

cgcttccgct 



gccgctggcc 
ccaagaagct 
gaagcccgtt 
tgcagtgtga 
agtgcttctg 
accccgtgcg 
acaagcagct 
agaaacctga 
gagacagcca 
gcttccctcc 
cggggacgga 
ggaggcaggc 
ctcgggacca 
gtcgcttgga 
gggaaatgca 
atgttcacag 
tacttaattg 
atataccatt 
tcatttaatt 
tggagcctgt 
ttgtccactt 
tcaccaaggg 
taagatgact 
aaaaaaaaaa 



gagaggctga 
ggtgcgctcc 
cggcctggag 
cgccaagttt 
cgacaggtgc 
gcagtgcgcg 
caaagtgctc 
aactatgact 
ctatgaaatc 
tggaggaggc 
gggtgtgacc 
ggtggcgtgg 
gtaactctac 
acagcagagc 
actcactcat 
tttattaatg 
ggggtgggag 
tttatgagtt 
cttccacaga 
tcgcctctaa 
accttgtcac 
gaacgtgggg 
gtacagcaat 
aaaa 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1955 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 

660 

720 

780 

840 

900. 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1434 



acgagactct 
ccctggagtc 
gggcaggcga 
gcacgcaggc 
tctgcctggg 
ccaacggtct 
tccagatggt 
ccccctacga 



ggcatgctaa 
ggccccacgg 
gctgtggctg 
caccatgaag 
caatacctac 
ccacggcttc 
gtcgctggtg 
cggcaatgag 



ctagtgacag 
atcatgcggc 
ccgcatggga 
ggcatcacga 
catctgggtc 
atgaattggc 
tctctgtccg 
accctgctga 



tcaagatggc 
tggtggccga 
cagtggccac 
ccgaacagct 
taaggccggg 
ctcataatct 
aggtgacgga 
gcccggagaa 



gggagcagct 60 
atgcagccgc 120 
tcctgtgttc 180 
ggacgctctg 240 
acccgagctg 300 
gctaacggac 360 
ggagggcgtc 420 
atccgtgcag 480 
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atccagaatg cgctgggctc ggacatcatc atgcagctgg acgacgtggt tagcagtact 540 
gtgactgggc cacgtgtgga ggaggccatg tacaggtcaa tccgctggct ggaccggtgc 600 
attgcagccc atcagcggcc ggacaagcag aacctcttcg ccattatcca gggtgggctg 660 
gacgcagatc tccgggccac ctgccttgaa gagatgacca agcgagacgt gcctggcttc 720 
gccatcgggg gcctgagcgg gggtgagagc aagtcgcagt tctggcggat ggtggcgctg 780 
agcacctctc ggctgccgaa ggacaagccc cgatatctga tgggggttgg ctatgccact 840 
gatctggtag tctgcgtggc tcttggatgt gacatgttcg actgcgtctt ccccacacgg 900 
acagcgcgct ttggctctgc cctggtgccc actgggaacc tgcagttgag gaagaaggtg 960 
tttgagaagg acttcggccc catagacccg gagtgcacct gccccacgtg ccaaaagcac 1020 
agccgcgcct tcctgcacgc actgctgcac agtgacaaca cggccgcgct gcaccacctc 1080 
acggtccaca acatcgccta ccagctgcag ctcatgagcg ccgtccgcac cagcatcgtg 1140 
gagaagcgct tcccggactt cgtgcgggac ttcatgggcg ccatgtacgg ggatcccacc 1200 
ctctgtccca cctgggccac tgacgctctg gcctctgtgg gaatcacact gggctgacct 1260 
ggcattggga gagggaggga ggaaggaagg gagggagggg ctggaagata ctgaaggatt 1320 
cctttttgaa aggttttttt tattgtaact taaaaaaaaa 1360 

<210> 64 

<211> 2277 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2228862CB1 

<400> 64 

aaacgccatg cggaggggcg agcgcaggga cgccggacgt ccgcggcccg agtccccggt 60 
gcccgcgggc agggcctcgc tggaggagcc gcctgacggg ccgtctgccg gccaagccac 120 
cgggccgggc gagggccgcc gcagcaccga gtccgaggtc tacgacgacg gcaccaacac 180 
cttcttctgg cgagcccaca ccttaaccgt gctcttcatc ctcacctgta cgcttggcta 240 
tgtgacgctg ctggaggaaa cacctcagga cacggcctac aacaccaaga gaggtattgt 300 
ggccagtatt ttggttttct tatgttttgg agtcacacaa gctaaagacg ggccattttc 360 
cagacctcat ccagcttact ggaggttttg gctctgcgtg agtgtggtct acgagctgtt 420 
tctcatcttt atactcttcc agactgtcca ggacggccgg cagtttctaa agtatgttga 480 
ccccaagctg ggagtcccac tgccagagag agactacggg ggaaactgcc tcatctacga 540 
cccagacaat gagactgacc cctttcacaa catctgggac aagttggatg gctttgttcc 600 
cgcgcacttt cttggctggt acctgaagac cctgatgatc cgagactggt ggatgtgcat 660 
gatcatcagc gtgatgttcg agttcctgga gtacagcctg gagcaccagc tgcccaactt 720 
cagcgagtgc tggtgggatc actggatcat ggacgtgctc gtctgcaacg ggctgggcat 780 
ctactgcggc atgaagaccc ttgagtggct gtccctgaag acgtacaagt ggcagggcct 840 
ctggaacatt ccgacctaca agggcaagat gaagaggatc gccttccagt tcacgccgta 900 
cagctgggtt cgcttcgagt ggaagccggc ctccagcctg cgtcgctggc tggccgtgtg 960 
cggcatcatc ctggtgttcc tgttggcaga actgaacacg ttctacctga agtttgtgct 1020' 
gtggatgccc ccggagcact acctggtcct cctgcggctc gtcttcttcg tgaacgtggg 1080 
tggcgtggcc atgcgtgaga tctacgactt catggatgac ccgaagcccc acaagaagct 1140 
gggcccgcag gcctggctgg tggcggccat cacggccacg gagctgctca tcgtggtgaa 1200 
gtacgacccc cacacgctca ccctgtccct gcccttctac atctcccagt gctggaccct 1260 
cggctccgtc ctggcgctca cctggaccgt ctggcgcttc ttcctgcggg acatcacatt 1320 
_ gaggtacaag-gagacccggt ggcagaagtg gcagaacaag gatgaccagg gcagcaccgf 1380 
cggcaacggg gaccagcacc cactggggct ggacgaagac ctgctggggc ctggggtggc 1440 
cgagggcgag ggagcaccaa ctccaaactg acctgggccg tggctgcctc gtgagcctcc 1500 
cagagcccag gcctccgtgg cctcctcctg tgtgagtccc accaggagcc acgtgcccgg 1560 
ccttgccctc aaggtttttt gcttttctcc tgtgcacctg gcgaggctga aggcgagggg 1620 
tggaggaggc cccagcacag cctcatctcc atgtgtacac gtgtgtacgt gtgtatgcgt 1680 
gtgtgtacgc gtgtgtacgc gcgtgtgtac acatgcgtgg ccgcctgtgg tgtgcacgtg 1740 
tgctctgggc tccgaggctt ctccagagct gggagctggc tggcgtggca agggcatgct 1800 
ctggggcagt gtgtccctca ggaaccaggg tcctccctcc cctttctgcc tggtcagccc 1860 
cgtggcctct ggcccaccaa gctccctgtc acccagccat ggtgtggtcc aggcagggac 1920 
atctcggtac cctttctgca ctccgtgggc cctgggtgcg ctgaggcctg gaggcgtcta 1980 
cactggctcc acatccactt cccccgcagc tcgtgtgggc gctcgtccac aaacactccg 2040 
tggctgagag gcagcggatc caggcagcga tgctgagcca cctcctccga gccttccttt 2100 
cacacagacc accccggagg acacgtggat gatggggtca gagatcactg agctgcccct 2160 
caagggggcc tggaacccgg gtgctggggt catgctgcct ccgtggctcc aaggtgaggg 2220 
tcatcttcac gagcaaagag aaccaataaa gtgacaacga acgtcaaaaa aaaaaaa 2277 

<210> 65 
<211> 1592 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2235577CB1 

<400> 65 

ccaagatggc ggcccccggg agctgtgctc tatggagcta ttgcggccgt gggtggtcgc 60 
gggcgatgcg gggctgccag ctcctcgggc ttcgtagctc ttggcccggg gacctactaa 120 
gtgctcggct cttgtcccaa gagaagcggg cagcggaaac gcactttggg tttgagactg 180 
tgtcggaaga ggagaagggg ggcaaagaac tcctttaatg tgaaagtttt tgctggcatc 240 
acttcttctg ggacatcttc aacctttttg tcacagctcc atttcttgtt catgttgtct 300 
atcaggtgtt tgaaagtgtg gctaagaagt atgatgtgat gaatgatatg atgagtcttg 360 
gtatccatcg tgtttggaag gatttgctgc tctggaagat gcacccgctt cctgggaccc 420 
agctgcttga tgttgctgga ggcacaggtg acattgcatt ccggttcctt aattatgttc 480 
agtcccagca tcagagaaaa cagaagaggc agttaagggc ccaacaaaat ttatcctggg 540 
aagaaattgc caaagagtac cagaatgaag aagattcctt gggcgggtct cgtgtcgtgg 600 
tgtgtgacat caacaaggag atgctaaagg ttggaaagca gaaagccttg gctcaaggat 660 
acagagctgg acttgcatgg gtattaggag atgctgaaga actgcccttt gatgatgaca 720 
agtttgatat ttacaccatt gcctttggga tccggaatgt cacacacatt gatcaggcac 780 
tccaggaagc tcatcgggtg ctgaaaccag gaggacggtt tctctgtctg gaatttagcc 840 
aagtgaacaa tcccctcata tccaggcttt atgatctata tagcttccag gtcatccctg 900 
tcctgggaga ggtcatcgct ggagactgga agtcctatca gtaccttgta gagagtatcc 960 
gaaggtttcc gtctcaggaa gagttcaagg acatgataga agatgcaggc tttcacaagg 1020 
tgacttacga aagtctaaca tcaggcattg tggccattca ttctggcttc aaactttaat 1080 
tcctttccta tcatggagca tgaaccagtc atatcctgtt gaaagcctgg aactgaagga 1140 
taatctggca aatgagacag cagcagagca tctcctctta aggatacgtg ccttggactc 1200 
atgtttgaat cgaacagtct caaagtggaa gaacaaattc ttgtcacttt tttacagctt 1260 
tctttggagc tgcttcagtc catctcccag aggcatttgg tctgtatctt tgctcaactg 1320 
ctaatttctc ttggctgtag ggtgtgtggt taaggtacaa ccacccctaa agctcagttt 1380 
tgaagtgagt gtatttatag cttctctgct ggtgctgcct tctagaggga tgatagatca 1440 
tttgaaccca atgacaattt ttaaccagaa aatttaattg tacctgaatc aacctttcag 1500 
cctaggacga agtctaggcc caagtcagag tattaatgat catgagaatt gtgtgctgaa 1560 
ccagtaaacg agtttacctt ttaaaaaaaa aa 1592 

<210> 66 

<211> 1390 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2271680CB1 

<400> 66 » 

gatcgtttgt cgcgatgtgg agtggccgta agctgggctc ctccgggggt tggtttttaa 60 
gagtgctggg gcctggaggc tgtaatacaa aagctgcgcg tcccttaatt tcctcggcgg 120 
tttatgtgaa gaaccagctc agtgggactc _tacagattaa accaggggtt ttcaatgaat- 180 - 
" acagaaccat atggttcaaa tcctacagga cgatcttttc ctgtttgaac agaataaaga 240 
gtttcaggtg gagtttcact tctgttgccc aggctggagt acaatggtgc gatcttggct 300 
cactgcaacc tccacctccc gggttcaagc gattctcctg cctgagcctc ctgagtcatt 360 
gggattacag gtacccttgg gcgagactgt acagtacttc ccaaaccact gtcgacagcg 420 
gtgaggtaaa aaccttcttg gccctggctc acaaatggtg ggatgaacaa ggagtatatg 480 
cacctcttca ttccatgaat gacctgaggg tgccatttat tagggacaat cttctgaaaa 540 
caattcctaa tcaccagcca ggaaaacctt tgttggggat gaagattctt gacgttggct 600 
gtggtggtgg gctgttaact gaacctctag ggcggcttgg ggcttcagtt attggaatcg 660 
accctgtgga tgagaacatt aaaacagcac aatgccataa atcatttgat ccagtcctgg 720 
ataagagaat agagtacaga gtgtgttccc tggaagagat tgtggaagag actgcagaaa 780 
catttgatgc tgttgtagct tctgaagttg tagaacatgt gattgatcta gaaacatttt 840 
tacagtgctg ctgtcaagtg ttaaaacccg gtggttcttt attcattact acaatcaaca 900 
aaacacaact ttcctatgcc ttgggaattg ttttttcaga gcaaattgca ggtattgtac 960 
caaaaggtac tcatacatgg gagaagtttg tttcacctga aacactagag agcattctgg 1020 
aatcaaatgg tctgtcagtt caaacagtgg taggaatgct ctataacccc ttctcaggtt 1080 
actggcattg gagtgaaaat accagcctta actatgcagc tcatgctgtg aaatccaggg 1140 
tccaggaaca cccagcctct gctgagtttg ttttaaaggg agaaacagaa gagctccaag 1200 
ctaatgcctg caccaatcca gctgtgcatg aaaagctgaa gaaatgaatt gtttctgaga 1260 
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actatagtaa tatggcttgg atatctgatg ttttcaaata caagaaatgt acaatttatc 1320 
ctttgagaga gaatcatgaa gaaaagaagg tcaataaaaa gggctaaaac cttggaaaaa 1380 
aaaaaaaaaa 13 90 

<210> 67 
<211> 899 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2325603CB1 

<400> 67 

ttcattggcg ggtccgccgc ggtcgaggat cctggaggtc ttaaccatga acttctctgg 60 
aggagggagg caggaagcag cagggtccag gagtagaagg gctcccagac cccgagaaca 120 
ggaccgagac gtgcagctgt ccaaggctct gtcctatgcc ctgcgccatg gggccttgaa 180 
gctggggctt cccatgggag ctgatggctt cgtgcccctg ggcaccctcc tgcagttgcc 240 
ccagttccgc ggcttctctg ctgaagatgt gcagcgcgtg gtggacacca ataggaagca 300 
gcggttcgcc ctgcagctgg gggatcccag cactggcctt ctcatccggg ccaaccaggg 360 
ccattccctg caggtaccta agttggagct gatgcccctg gagacaccgc aggccctgcc 420 
cccgatgcta gtccatggta cattctggaa gcactggcca tccatcctac tcaaaggcct 480 
gtcctgccag ggaaggacgc acattcacct ggccccagga ctgcctggag accccggtat 540 
catcagtggc atgcggtccc attgtgaaat agctgtgttc atcgatggac ccctggctct 600 
ggcagatgga atacccttct tccgctctgc caatggggtg attctgactc cagggaatac 660 
tgatggcttc ctccttccca agtacttcaa ggaggccctg cagctacgcc ctacccgaaa 720 
gcccctttcc ttggctggtg atgaagagac agagtgtcag agtagcccca agcacagctc 780 
cagagaaagg aggaggatcc aacaataaaa tattaattta taaaaaagaa attttaaaaa 840 
gtaacaagaa agaactcgtt tgaaaccatg tttcatcatc ctgtaaaaaa aaaaaaaaa 89 9 

<210> 68 

<211> 1041 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2356055CB1 

<400> 68 

gacgtaggac ctgatagcaa ggagggggca catttcaggg acaccatgaa gggtggcttc 60 
actgggggtg atgagtacca gaagcacttc ctgcccaggg actacttggc tacttactac 120 
agcttcaatg gcagcccctc acccgaggcc gagatgctga agtttaactt ggaatgtctc 180 
cacaagacct tcggccctgg aggcctccaa ggggacacgc tgattgacat tggctcaggt 240 
cctaccatct accaagttct tgctgcctgt gattccttcc aagacatcac tctctccgac 300 
tttaccgacc gcaaccggga ggagctggaa aagtggctga agaaggagcc gggggcctat 3 60 
gactggaccc cagcggtgaa attcgcctgt gagctggaag gaaacagcgg ccgatgggag 420 
gagaaggagg agaagctgcg ggcagcggtg aagcgggtgc tcaagtgcga tgtccacctg 480 
ggcaacccgc tggccccggc __tgtg.ttgc.ct_ ctcgccgact-gtgtgctcac -cctgctggcc 540 
-atggagtgtg'cctgctgtag ccttgatgcc taccgcgctg ccctgtgcaa ccttgcctca 600 
ctgctcaagc cgggtggcca cctggtgacc actgtcacgc ttcggctccc gtcctacgtg 660 
gtggggaagc gtgaattttc ctgcgtggcc ctggagaaag aggaggtggc agccaggcaa 720 
tgtccaggag aggagattgc caaggaaagg aggctacaaa tgccccctcc ttgtgatgtc 780 
aggacctccc ttagcgagcg atctggccaa gacacaggga aaagacacag gatccagacc 840 
cggggctctg ctccttggac ggctcagtgc agagagtcag ctggctgcct ggaaggagag 900 
agtcggcaag ggtgtgaggg aatctttggg tgctgtggaa "gctgfctctac cttatgaaat 960 
ggggctggga tggactgagt gactatgctg tgctctgtca tttgtccgta agtactcgct 1020 
gcatactctg atgcgctgat g 1041 

<210> 69 
<211> 1106 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2448909CB1 
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<400> 69 

acggccagtg caagctaaaa ttaaccctca 
ctagaactgt atttcagaaa aaagaaacta 
aacaagccgg atcagaagac gaaaactctg 
gagccacaag tctgaattta tagagctgag 
ttcaaactta gcgcctgctt gttttccagg 
cctgcaggag ggacagatga ttatttcgtt 
agtgattcga agctacttag gggcatacat 
gctggcgctg tgcacctttt tagtttcaga 
gccttacctg gagattttac ccaaggcgta 
ggtgaacctt cttcccaaat ctttaaaagc 
ggagttcttt gcttcctcca gagacttttt 
tgttgacagc atcttcagct acagtgccct 
agccgtgtac ctgaggccca ggcagcggga 
actcgctccg tacctggacc tgctgaatca 
taatgaagaa actcattctt acgaaattag 
ggtattcatc tgttacggcc ctcacgataa 
ttctgtccat aatcctcatg cttgtgttta 
ttaacattaa cactatataa tttttttccc 
aaaaatatca aaaccttaaa aaaaaa 

<210> 70 
<211> 2405 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2631212CB1 



ctaaagggaa taagcttgcg gccgccgcgc 60 
cagttttagc atgcagaaag gaaaagggag 120 
cggaagttct gaatcaagag gagtgaatga 180 
gaagtggctg aaagctagga agtttcaaga 240 
tacaggaaga gggctgatga gtcaaacatc 300 
gcctgagagt tgcctgctca ccacggacac 360 
tactaagtgg aagcctcctc catctcctct 420 
aaagcatgct gggcaccgat ctctttggaa 480 
tacctgccct gtttgtttgg agccggaagt 540 
aaaggctgaa gagcagagag cccacgtgca 600 
ctcttctctg cagcctctgt ttgcggaggc 660 
gctgtgggct tggtgcaccg tcaacaccag 720 
atgcctttct gcagagccgg acacctgtgc 780 
tagcccacat gtccaggtaa aagcagcgtt 840 
aacgacttca cgttggagaa agcatgaaga 900 
tcaacggctg ttcctggaat acggatttgt 960 
tgtctcaaga ggttggaatc aactttgttc 1020 
catttggaga tgtgtatttt cagttttaat 1080 

1106 



<400> 70 

cagcgggtct ggctggcggc agcggcggga gggagccgag agacccgagt gcacgtgtgg 60 
agaagcggcg gcacaagcgc ggcggcggga gacactcccg cccccaccag actcaagccc 120 
tcactcgact ctcgcggcct tcgttgctcg cacagctccc tgcccaggct aggaggccgg 180 
cttgcggggt tgagtggccc gagctaaggg tgcggagacc taagggcggc gactacgacg 240 
gcgttgatat cggtggtaac gacggcctca gcaggcgggg aagatgaaag gtagccggat 3 00 
cgagctggga gatgtgacac cacacaatat taaacagttg aaaagattga atcaggtcat 3 60 
ctttccagtc agctacaatg acaagttcta caaggatgtg ctggaggttg gcgagctagc 420 
aaaacttgcc tatttcaatg atattgctgt aggtgcagta tgctgtaggg tggatcattc 480 
acagaatcag aagagacttt acatcatgac actaggatgt ctggcacctt accgaaggct 540 
aggaatagga actaaaatgt taaatcatgt cttaaacatc tgtgaaaaag atggtacttt 600 
tgacaacatt tatctgcatg tccagatcag caatgagtcg gcaattgact tctacaggaa 660 
gtttggcttt gagattattg agacaaagaa gaactactat aagaggatag agcccgcaga 720 
tgctcatgtg ctgcagaaaa acctcaaagt tccttctggt cagaatgcag atgtgcaaaa 780. 
gacagacaac tgaacaaatt acaaatgaac tttcttgcac ttgcttgtcg ccaaataaaa 840 
gagaggccca ttgattcctc ccccacccca acacttttct tttaaagctt ttctccctcc 900 
ttgttcttgt ttttctttct tcctttcctt ttctctgaga gttttaatac tttcaaggac 960 
tttaaaaaaa taatcatgtt tgaattgttt tctcttattt ttgtgaggtg gtttgaagga 1020 
aggacaaggt agatctgttt agttttgcag ttgaagttag atggtcctaa- acatttaatt 1080 
gtcaaataatf ttcaaattta atgtcctgct ttcacattga agggcagagc ctacaaaaca 1140 
ttgtatattt caaaagacaa aaagaagcag cagcagtatc ttgttctcta attcatagac 1200 
aagttgagtg tgtttgtggt actttgggtt tttaaacact ttgggatact aatccctaga 1260 
cattgccttc actccacctt tagtccttct gagcactctc tcgggagttg gaacattgtt 1320 
atccttgtaa gaaatactaa gcttatgttg atttttaagt aattatatct tctcttcttg 1380 
ctggtgggtg gggcagtttg gtttagtgtt atactttggt ctaagtattt gagttaaact 1440 
gcttttttgc taatgagtgg gctggttgtt agcaggtttg tttttcctgc tgttgattgt 1500 
tactagtggc attaactttt agaatttggg ctggtgagat taattttttt taatatccca 1560 
gctagagata tggcctttaa ctgacctaaa gaggtgtgtt gtgatttaat tttttcccgt 1620 
tcctttttct tcagtaaacc caacaatagt ctaaccttaa aaattgagtt gatgtcctta 1680 
taggtcacta cccctaaata aacctgaagc aggtgttttc tcttggacat actaaaaaat 1740 
acctaaaagg aagcttagat gggctgtgac acaaaaaatt caattactgt catctaatgc 1800 
cagctgttaa aagtgtggcc actgagcatt tgattttata ggaaaaaata gtatttttga 1860 
gaataacata gctgtgctat tgcacatctg ttggaggaca tcccagattt gcttatactc 1920 
agtgcctgtg atattgagtt taaggatttg aggcaggggt aattattaaa catattgctt 1980 
ctattcttgg aaaaatagaa gtgtaaaatg ttaataatac aaatgtcact gtgacctcct 2040 
ccactgagag gactggttta tgccagatca ttttccggca cacacggagt ggctttgaca 2100 
gattgataac tttgtaagat gggagacatc tgaaatattc atgttttcct tttgtagtcc 2160 
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catctccact atttagaaat gttctcagac tttaaaataa tgcacagggc ttgagctttc 2220 
tgtcatttga ctttaaaagg aagtttcatt catatttatc ctcttatgta aaattgcggt 2280 
ataaagtctc atttccaaat atgttaaatg acaaaattat tttataaaat gtttatgcac 2340 
actttataac cttaagtttt tatttgagaa tgtgaaagta caaagtgcag tagacttcaa 2400 
caatc 2405 

<210> 71 

<211> 1239 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2678733CB1 

<400> 71 

ggcgatgagg gtgttggtgc 
tgggaggccg agtccccagt 
ctccagagtc cgcgagaagc 
ccgcgaggcg ctgcgggcgc 
caaactggag gtggtcacaa 
tgctgtgcag tctcagcttc 
tgttggagta gtggettcgt 
ctatggcata ttgaatgttc 
aatccataca gtgcttcacg 
taaaaggttt gatgtaggcc 
cactgcaaag gaattggaag 
tttgaaaaat ttgcctgaaa 
ttacgcccct aagatttctg 
acaaatattc agactttacc 
ggcgaatacc attaaacttc 
aaaattaacg ggacaggctc 
actattggtt tattgcaagg 
actaacagct actgacttct 
agctcaacca agccaatgca 
aaaaactgtt gctatgcaac 
ttacatattt gtaatttatt 

<210> 72 
<211> 2295 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 2768571CB1 

<400> 72 

aaactcagca cttgccggag tggctcattg _ttaagacaaa. gggtgtgcac-ttcctggcca-60- 

ggaaacctga gcggtgagac tcccagctgc ctacatcaag gccccaggac atgcagaacc 120 

ttcctctaga acccgaccca ccaccatgag gtcctgcctg tggagatgca ggcacctgag 180 

ccaaggcgtc cagtggtcct tgcttctggc tgtcctggtc ttctttctct tcgccttgcc 240 
ctcttttatt aaggagcctc aaacaaagcc ttccaggcat caacgcacag agaacattaa 300 

agaaaggtct ctacagtccc tggcaaagcc taagtcccag gcacccacaa gggcaaggag 360 

gacaaccatc tatgcagagc cagtgccaga gaacaatgcc ctcaacacac aaacccagcc 420 

caaggcccac accaccggag acagaggaaa ggaggccaac caggcaccgc cggaggagca 480 

ggacaaggtg ccccacacag cacagagggc agcatggaag agcccagaaa aagagaaaac 540 

catggtgaac acactgtcac ccagagggca agatgcaggg atggcctctg gcaggacaga 600 

ggcacaatca tggaagagcc aggacacaaa gacgacccaa ggaaatgggg gccagaccag 660 

gaagctgacg gcctccagga cggtgtcaga gaagcaccag ggcaaagcgg caaccacagc 720 

" caagacgctc attcccaaaa gtcagcacag aatgctggct cccacaggag cagtgtcaac 780 

aaggacgaga cagaaaggag tgaccacagc agtcatccca cctaaggaga agaaacctca 840 

ggccacccca ccccctgccc ctttccagag ccccacgacg cagagaaacc aaagactgaa 900 

ggccgccaac ttcaaatctg agcctcggtg ggattttgag gaaaaataca gcttcgaaat 960 

aggaggcctt cagacgactt gccctgactc tgtgaagatc aaagcctcca agtcgctgtg 1020 

gctccagaaa ctctttctgc ccaacctcac tctcttcctg gactccagac acttcaacca 1080 

gagtgagtgg gaccgcctgg aacactttgc accacccttt ggcttcatgg agctcaacta 1140 



ggcgctgttg gggtcctccg 
ggcgagcact ggcccgactc 
ctccctggcg ggtgctcttc 
tgcacgccgc cagggaaaac 
tgccttcccc atcaccaaaa 
ccgtatatga gtggccggat 
ctggccgact tttgaatgag 
atcccagttg cctcccgaga 
gagacacagt tactggagta 
caattctcaa acaagaaact 
cagtgttgtc aagactgggt 
gtctgagcaa tggaaggcag 
ctggtaccag ttgtataaaa 
gtgccattgg aaatataatt 
tggatttggt agaagttaac 
ttattccagg atcagtaata 
atggttggat tggtgttcga 
acaatggata tttgcacccc 
gatttcagac tctcagactt 
aatgcattga gtagttagga 
aaaaacctta tttacaagg 



ctggctcatg gcgccaggcg 60 
ggctgggagg actgccggga 120 
ttcggcacgg accagttcgc 180 
aaagaagaag agttaatcga 240 
ggactgccag tgaagcaata 300 
gtgggatctg gagaatatga 360 
gctcttattc ttaaatttcc 420 
tggcgtggcc cagcccctgt 480 
acaattatgc aaattagacc 540 
gttcctgtgc cacccaagag 600 
gccaacatgc tcatttcagt 660 
cagccaatgg agggggcgac 720 
tgggaggaac aaacttcaga 780 
ccgttgcaga cgctctggat 840 
agttcagtcc ttgctgatcc 900 
taccacaaac agtcacaaat 960 
tcagtgatgc tcaagaaatc 1020 
tggtaccaga aaaattccca 1080 
ccaacaaaga agaagcagaa 1140 
agaagatgga taaaaaccta 1200 

1239 
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ctccttggtg 
ggccagcctc 
catcctgaac 
gagcggagct 
ctttaccgcc 
cgtgcctctt 
gtggctggaa 
gcacagaccc 
cccagacttt 
ccactggagg 
ctgtgaccag 
ctactatgat 
gagagaagtc 
tcccggaact 
gctccaaggc 
gactaagctc 
ctcaagatgg 
tgaggccagg 
aaaaacactc 
aacatataga 



cagaaggtcg 
cccgctggga 
aactcccaca 
ctcattaaag 
ttctccctga 
gggaaggacg 
gcactgctta 
caggaagctt 
ctccgataca 
atataccgcc 
gtgagtgctt 
acatcatgga 
tggaagcggc 
gccaaagcca 
acaggataca 
caagcccttc 
caaatggcta 
gatttttaat 
ttccagtcca 
tctgg 



tgacacgctt 
gcctccggtg 
tgggccagga 
gctacgaaca 
cccagtcact 
tccgctactt 
tgaatcagac 
ttcgggaagc 
tgaagaacag 
ccaccactgg 
atggcttcat 
agcggctgat 
tacacgatga 
agaactgacc 
gtgggaatct 
aagagttcca 
attgaggttc 
taaatggggt 
aaagcttctt 



ccctccagtg 
catcacctgt 
gatagacagt 
ggatgtgggg 
ccttatattg 
gcacttcctg 
ggtgatgtca 
cctgcacatg 
gtttctgagg 
ggccctcctg 
cactgagggc 
cttttacata 
agggataatc 
ggggccaggg 
tgagactctt 
agggaacact 
tgaagttctt 
gatgggtggc 
gatacagaaa 



ccccagcagc 
gccgtggtgg 
cacgactacg 
actcggacat 
ggcaatcggg 
gaaggcaccc 
aaaaaccttt 
gacaggtacc 
tctaagaccc 
ctgctcactg 
catgagcgct 
aaccatgact 
cggctgtacc 
ctgccatggt 
tggccatttc 
tgaaccatgg 
cagtacattg 
caataccaca 
aaagagcctg 



agctgctcct 
gcaacggggg 
tgttccgatt 
ccttctacgg 
gtttcaagaa 
gggactatga 
tctggttcag 
tgttgctgca 
tggatggtgc 
cccttcagct 
tttctgatca 
tcaagctgga 
agcgtcctgg 
ctccttgcct 
ccatggctca 
acaagactct 
ctgtaggtcc 
attcctgctg 
gatttacaga 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2295 



<210> 73 

<211> 2182 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 3189062CB1 



<400> 73 

ccggcatggc 

gtgagccgac 

gcaacatgga 

ggtttaccca 

gtgtggaaaa 

agcagcacct 

agctcccgga 

gtcttgaggt 

ctgatttgcc 

tccaagccaa 

ttggcgtcca 

cagtgacact 

ataagtttga 

tggcatcgtt 

ttacattgcc 

cccatgctgt 

aaatacactt 

tggggctgat 

atgctggagg 

aggctcttgg 

ccaaccaatg 

ccaagccaga 

agcaggtgca 

atgctcccca 

gctagaggag 

cagttcagtc 

tagccctgaa 

agaaaatgaa 

gcaattatga 

cctcgggttt 

gcaactatac 

gctctggcta 

gtgaggaggc 

gtaatgggag 

ttaagtcctt 

aacaaaaatt 



cgggtgagct 
tctgaggaga 
gagagaaagt 
gcgatacaga 
gatgccggat 
gcaggagctg 
tcctgttcct 
gagtcgctgg 
ccgatcatgg 
gcagtggaaa 
gcgtttggca 
gctgctgggt 
cgtgacccag 
gtcctgtgct 
tttcctagtt 
agttgctctg 
tggagataac 
tcccagctct 
-cattttgcat 
agtcagcaaa 
gaaaaatgga 
gtggcaaagg 
tgggaaacca 
tgtggatcac 
gtagatcctg 
caggttgtca 
agcaggctct 
atacctgttt 
attccaaatt 
cgttatttgc 
acaggacatc 
ccatatagaa 
ttctgtggtt 
gtagattgat 
aacatgaagc 
attttagtgt 



gcaggcttcc 
tggagtatcg 
gggaagcccg 
gaatatctcc 
ggctcggtgg 
aggaatcgtg 
tcgaagaggg 
gtggtgggtc 
caacggcatg 
aatctgggac 
aaacgagggc 
gaccatggct 
tgtatgttct 
ggagaagtgc 
catgctggtg 
agaaataacc 
agaaaactga 
gaagaaggct 
"atccaccaaa 
gtagagaaag 
gctaccaggg 
tgggcagaat 
tggaagacac 
atagtcctgg 
ggacacatgg 
tcccttttgt 
agatcaattc 
gggagaagca 
ctgacctcag 
aaaatgagag 
ggttatttta 
aattgactga 
gtccaggtct 
atgataaata 
tgctttgtta 
atttccattc 



ttatttaaga 

ctgaggtgat 

tggctgttgt 

agaggcagaa 

cgctaccggt 

ttgccccagg 

cccagggttg 

ggggagtcaa 

gtaatctctt 

cggaactctg 

gggtatcacc 

gggtagagca 

cctttggaaa 

tggtggatct 

ctgccttcgt 

ttgagatcaa 

agctctcaaa 

ggcccattgc. 

atgtggaatc 

agcattggct 

attctagggg 

ctgcagaaac 

aaattctgca 

atctggaatg 

gatccacgtg 

cccctggtga 

aaattatttc 

gcatggccca 

ttctggaatt 

tttctttgaa 

gagtgaaaga 

aggagggcca 

gaggtgatgg 

aaattgacag 

ttatgactga 

ccttactgtg 



ccgggaattt 
gagagagaat 
cgcagttgtg 
actctttgat 
gctgggagag 
cagtccctgt 
ttcacctgcc 
gtggtcagcc 
gttgctgagt 
ggagaccgtt 
ggatggtact 
tgtggataat 
catcactgag 
ctatgcaggg 
ccatgcttgt 
tggagtagca 
tattgcagat 
ctgccaagtg 
tttcccaggg 
gtatcctcag 
aaaaatgctg 
tcgaatcgcc 
catccaacca 
ctgcccctgt 
cgagtggccc 
tcagtttttt 
atttgtcttt 
ttgaaatgag 
gaagtttcag 
ctgtctcacg 
cacagtgctt 
agatggaaac 
taacttggac 
accaagcaat 
ctaaattaga 
gcattcctaa 



agtcagcctg 
gtggttgtta 
actgagcctt 
acacagcacc 
acgcttccag 
atgctcacgc 
caaaaattgt 
gagttggagg 
gaagactgtt 
gccttggcac 
cgaactccag 
ggtatccgtt 
aagcttcgag 
attggttatt 
gagtggaatc 
gatcggtgcc 
agggtgatcc 
ttaaggcagg 
aagaatcttc 
caaattacca 
tcaccagcca 
actcttcttc 
gtgaaatcct 
ccttcagttg 
ttaaatgtat 
tcatatttta 
cattgataac 
gctcatctgt 
tatgttttgg 
tgactattaa 
tttccaaatt 
agagagacca 
tcggatggtg 
ggaatcagaa 
gagagaaggg 
atgattgtga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
-1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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ggggttgtct tataaatttg gt 2182 

<210> 74 

<211> 1288 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3243884CB1 

<400> 74 

cgtttctttc ggagcggcgg tgaaggtcct gggtgaggta ggggtggatg gtgcttgccg 60 
cgtatcatgg ctgcctccgg aaaactcagc acttgccgtc tccctccgtt gcccacgatt 120 
cgagaaatca ttaagttgtt aagactgcaa gcagcgaagc agctatcaca gaatttcctc 180 
ctggacttga ggctgacaga taagattgta aggaaagctg gcaatctgac aaatgcttat 240 
gtttacgaag tgggccctgg gccaggggga atcacaagat ctattcttaa tgccgacgtc 3 00 
gctgaacttc tggtggttga aaaggacact cgatttattc ctggattaca gatgctttct 3 60 
gatgcagcac ctgggaaact gagaattgtt catggagatg tcttgacatt taaggtagaa 420 
aaggcttttt cagaaagtct taaaagaccc tgggaagatg atcctccaaa tgtacatatt 480 
attggaaatc tgccttttag tgtttcaact ccactgatta tcaagtggct tgaaaatatt 540 
tcctgtagag atggaccttt tgtttatggc agaactcaga tgactttgac ttttcaaaag 600 
gaagtggcag agagacttgc agccaataca ggaagcaaac agcgtagtcg cctctctgtt 660 
atggctcagt acctctgcaa tgttcgacac atctttacaa ttccaggaca agcttttgtc 720 
cccaaaccag aggtggacgt gggcgtggtg cacttcactc ccttgataca gcccaagata 780 
gagcagccat tcaagctggt ggaaaaagtg gttcagaatg tatttcagtt ccgaaggaaa 840 
tactgccatc gagggctcag aatgttattc cctgaagcgc agcgcttgga aagcacgggc 900 
aggctgttag agttggcaga catagaccct actcttcggc cccgccagct ctccatctca 960 
cactttaaga gcctctgtga tgtatacaga aaaatgtgtg atgaagaccc acaactcttt 1020 
gcatataatt tcagagaaga actcaagcga agaaaaagca aaaatgaaga aaaagaagag 1080 
gatgacgcag agaattacag actctagctg ctgcctgggg gcgagcagcc taccagatgt 1140 
cgatttgcac tacgtggagc ttcttatata ggtactcttt tgtctttaca gaatgacgat 1200 
acaaatgcca atgaccagat gtgacttatt ttccttttac tatacagctt ggcagagaaa 1260 
ataaatatca tcaaataaga aaaaaaaa 1288 

<210> 75 

<211> 1130 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3400578CB1 

<400> 75 

ggtgaacccg cagccggact tctgctgcac tggggctccg aatgacccag aatctggggt 60 
gggcttaggc cccgggagtg agatggcctc aatcttgcga acgcctcagg ctctccagct 120 
cactctagcc ctgatcaagc ctgacgcagt cgcccatcca ctgattctgg aggctgttca 180 
tcagcagatt ctaagcaaca agttcctgat. tgtacgaatg. agagaactac- tgtggagaaa 240" 
ggaagattgc "cagaggtttt accgagagca tgaagccggg ccaatccgag cctacatcct 300 _ 
tgcccacaag gatgccatcc agctctggag gacgctcatg ggacc caeca gagtgttccg 3 60 
agcacgccat gtggccccag attctatccg tgggagtttc ggcctcactg acacccgcaa 420 
caccacccat ggttcggact ctgtggtttc agccagcaga gagattgcag ccttcttccc 480 
tgacttcagt gaacagcgct ggtatgagga ggaagagccc cagttgcgct gtggccctgt 540 
gtgctatagc ccagagggag gtgtccacta tgtagctgga acaggaggcc taggaccagc 600 
ctgatgcagg tctatgaaga ccagtggtag tgcccagact tctcctagac atctagtcta 660 
aaacattctc ctaggaccag ggaagcctgg cttacagtgc catttctgct gggcaccacc 720 
acctgcctga gggcctagct caccacagca catcctccag gatctagcct tctatctacc 780 
tcttctctgg aatgtttatg gtggttcaga agaatgatga ctcctctttg ctgagaactg 840 
ttcatccttc ttcaagaaga agcttgccag gccgggcacg gtgctcacgc ctataatccc 900 
agcactttgg gaggccgagg cgggcggatc acaaggtcag gaattcgaga ccagcctgac 960 
caacatggtg aaaccccatc tctactaaaa atacaacaat tagccaggca tggtggtgca 1020 
tgcctgtaat cccagctact cagaggctga ggcaggagaa ttgcttgatc ctgggaggca 1080 
gagggtgcag tgagccgaga tcgtgccatt gcactccagc ctgagggagg 1130 

<210> 76 
<211> 1815 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 3422577CB1 



<400> 76 

gacggcactg 

gccgctcaga 

cccgcgccga 

ttttcccagg 

ggacaaggag 

ccgggcagaa 

ggccctttcc 

ggccatgatt 

cgtgatgtgc 

ccactcatat 

ctatttcttc 

gcctttgcgg 

attctgcatg 

tggctggacc 

cctatttgaa 

catggcctat 

gaagacctgg 

gtcctatgtg 

caacagcttc 

tcctggggtg 

tcacagcatc 

aagtggattc 

aggcatcatg 

cgccagtttt 

gccagatcag 

gctgacatcc 

actgagcagg 

cttcaactca 

aaatctgtat 

gttggaaaga 

aatctggaag 



ggtggggccg 
gccgcgcgcc 
gctgcctgct 
atgacagagc 
tcagagtcag 
tccgcacccc 
aacttgtctt 
gcatttttct 
gttcagatta 
gatctgccct 
tatggtgaga 
attctcagta 
tttgtactga 
catgtgacat 
ggaatgatct 
atgtttggct 
gaaggcttca 
atgtccgggt 
actgtggact 
atccagtcag 
gctctctcca 
aaacgagcct 
gatcgctttg 
atcagaggcc 
cagctccaca 
accacagagg 
ggcaggtctc 
ctgtcttttt 
atacagtcta 
agtcacgggt 
acggt 



agctccaggg 
tcccgggcgg 
ccggcggctt 
tgaggcagag 
aagcaaaggt 
tgccagtctc 
caagatggaa 
tcatcatcat 
agtgtttcca 
ggttcaggac 
cagtgacgga 
aataccaccg 
gtctggtcaa 
tgctgattgt 
ggttcattgt 
ttttctttgg 
ttgggggctt 
acagatgctt 
gtgagccctc 
tcattggctg 
cctttgcctc 
ttaaaatcaa 
actgccagta 
ctaacccaag 
tcttcaacac 
acgagtaggg 
caaggcaagc 
tttttttttt 
tgtgtttaga 
tgtaaaacca 



ctggctgctg 
ggcggggccg 
cgctgctagc 
ggtggcccat 
agatggagag 
tgcagatgat 
gaactggtgg 
ttacctggga 
tgagataatc 
get cage tgg 
ttacttcttc 
gttcatttcc 
gaagcattat 
tgtaacacag 
ccccatatct 
tcggacccca 
etttgetact 
tgtctgccct 
ggacctgttt 
gaaaacggtc 
gctcattggc 
agactttgee 
tetgatggee 
caaactgatt 
getgeggtet 
gccacccagg 
ccagctggtg 
tggagggtat 
atttgtgttg 
tttggatttt 



ggctgctaag 
gccgtgggag 
tcgcggcgac 
gagccggttg 
actgeategg 
accccggagg 
gtgagaggca 
ccaatggttt 
actattggct 
tactttctcc 
accctggtcc 
tttactctct 
egactgeagt 
tcacatcttg 
tgtgtgatct 
ctcatcaagc 
gtggtgtttg 
gtggagtaca 
cgcctgcagg 
eggatgtace 
ccctttggag 
aataccattc 
acctttgtca 
cagcagttcc 
catctgatcg 
gecaggagaa 
tgacttagac 
tttttatttg 
taagtaaact 
tttaaaacaa 



ggaactgtga 
tccgcgcgtg 
gtegggcega 
cgccacccga 
acagtgagag 
tcctcaatag 
. tcctgacttt 
tgatgataat 
acaaegtcta 
tgtgtgtaaa 
agagagaaga 
atctaatagg 
tctacatgtt 
t tatccacaa 
gtaatgacat 
tgtccccgaa 
gccttctgct 
acaatgacac 
agtacaacat 
ccttccagat 
gattcttege 
ctggccatgg 
atgtatacat 
tgactttacg 
acaaagggat 
caggaacaga 
aatgacgagg 
tgggttcaaa 
acagctttga 
aagtattaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1815 



<210> 77 
<211> 1740 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



3706809CB1 



<400> 77 

tgacgcagcc 

gcgaccgttg 

getgeggegg 

cgctcctggg 

gaaacaaccg 

tegctacgat 

gtccaggcca 

agacgctctc 

aaaggagttg 

tatcaaggat 

tgaccatgtt 

gatttgtgac 

ggcaggctgc 

gctccgggcg 

aaccgtgccc 

ttatgcccag 

agtgatgaag 



egggtctcag 
ctgcaggtgg 
agcccagtga 
aagcagcccc 
ctcgaggagt 
aaagcttatc 
tttcgggaaa 
aaggctggag 
ccagtcgata 
tggtccgacc 
aagatgacat 
aatctccgtg 
agcaaagtgt 
ggtatgggcg 
aattacctgc 
gctgagatgt 
tttcacaagt 



ggaacatggc 
tccaggcttg 
aagtggtgtt 
gcaaggcacc 
ccgcatcccg 
ccggggacag 
aacaagggaa 
ctgtgccaaa 
agctgaaagg 
tcgtaacgcc 
atccaaagac 
accctgggaa 
tactcaccaa 
cacatttccg 
cccctgacac 
ctaataaagc 
atgaggaaga 



ggcgctggtg 
ggaccttgac 
tccttccgga 
atetgaggee 
cgctcccagc 
gaggctgagc 
gatcctgetg 
aatgttcttc 
tgtcagcctc 
acaaggaata 
tcagcttcag 
cctggggaca 
aggctgtgtg 
gatgeccatt 
tegggtctat 
tagtgaccat 
ggaagatgta 



agacccgcga 
gegaggeget 
gaggtggtgg 
agtgcccagg 
acctgggaag 
agtgtaatga 
gaaggtcgea 
tttagccgtc 
attaaggtga 
atggggattt 
cattcactgc 
attctgagat 
gatgcctggg 
atcaataatc 
gtggctgaca 
ggctgggtgt 
gaaaceggag 



ggtttgtcgt 
gggtceggge 
aacagaagcg 
agcaacgaga 
agtctgggct 
caatagtaaa 
ggctcatttc 
tagaatacct 
aatttgagga 
ttgccaagcc 
ctttattatt 
ctgcagctgg 
ageccaaagt 
tggaatggga 
actgtggcct 
gtgatcaacg 
ccagtcaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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cgactcggac tggacagagg cgccggcagc 1080 
gagcctggag tccctgcagc tggccgagag 1140 
tgtgcctggt gtggacagcc tcaactcggc 1200 
gaaaagacag ctgcggggga gggcggagga 1260 
acgcagaagt gacttctgct tgaggacgtc 1320 
ggaggctggc ggagtcagtg actatggccc 1380 
atacagttac aggaaaaata agaacttcct 1440 
gcgtcactga ttttgaggtt cttttttctc 1500 
gtgtgttttt aaaaattgtc caccaagaag 1560 
catcctggca gggaggcgcc tgctccacca 1620 
agctataggt cacagccaca tcactcacag 1680 
gagcagcaga ttacgttaaa aaaaaaaaaa 1740 



cagactgtgg gtaccgggtc cgagggactc 60 
gaaagccacg aacgtggtgc ggctgctact 120 
cggctccggg acggtcgccg cgtccaagtc 180 
cgagaccccg ctgctgctgg aggcaagtga 240 
ttggcagttt ttggaaactg tgcaagaatt 300 
ttcttattac aacttaatcc tgaagaaagc 360 
ccttttaaag tttgctttct ctataagggc 420 
gattgcagct gatgagccac caccagatgg 480 
gcacacctgt aaaattaatg agattaaaaa 540 
accttatcta tttaaaggag atcacaaatt 600 
gattctctat gccgaaatgg gtactagaac 660 
aaaagctcaa aatgaggaaa ttctgtatgt 720 
acggaaaatg tacttatctg ggtatggtgt 780 
agcactggat gatacccaag ttaaaactgt 840 
aacaaatgaa gttcaaggat ttctctttgg 900 
agataatctg acagcattcc aaaaatacct 960 
gaaagtctgg gaactacaag atcttagttt 1020 
agtttatgat gccattaaat taatgaaaga 1080 
agtccaaatg attggtaatg tcttaattgg 1140 
agaggatgtc tcgttgaact gcttccatga 1200 

1225 



ttggctgcct catgttgagg ttcagagtta 
tgtggtgatt ggcggggaga cctacggcgt 
cactggtggc aagaggctgc tgatccccgt 
catggcggca agcatcctgc ttttcgaagg 
cttgagcagg gacaggagtt accactgagg 
tgcagctcct cctacaccag cacactggtg 
ccacgttcag gaggaaggtg tgatgccgtc 
cagaaagaac aggtccgaat tcttcctgtc 
ttggtgacaa taggtgaccc acgtggctct 
cactttgtgc ccagaaagtt cctgaagcat 
gctggtgggt gtttgtaatc gccaagcacc 
ctgatcactg gttggtggaa aataaactat 

<210> 78 

<211> 1225 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3745914CB1 

<400> 78 

cgggaagagc cggccgaagc gtggcggcca 
gcgcttttct ctccgtgcca tggcgccagc 
aggctccaca gcgctgtggc tttcgcagct 
ggtgactgcc cacttggccg cgaagtggcc 
atttatggca gaagaaagta atgaaaaatt 
agcaatttat aagcaaacag aatcagatta 
tggacagttt ctagacaatt tacacatcaa 
atactcccca gctattcaga tgtttcagca 
ttgtaatgca tttgtggtta ttcataagaa 
gctgctgaag aaagctgctt caaggactag 
tcctacaaac aaagagaact taccagtggt 
atttagtgca tttcacaaag tattgtctga 
tcttcgccat tatattcaga aaccaagctc 
ggagctagca attaagagta cagaatacaa 
gactaatact actgtagagg atgagactga 
gaaactaaaa gaaatatatt cagatcttag 
gattgagagt aacaaacaaa tgatgccttt 
tcaagcagct tctcaaataa tgtccgctcc 
catttcacag aacttcccca taaaagccag 
atgaatattg tgtggagtac ttttttgcca 
atactgatgt tacattaaac atata 

<210> 79 
<211> 800 
<212> DNA 

<213> Homo sapiens^ . - 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4000776CB1 

<400> 79 

atttacagct tgtcatcgtg tcatcccttt 
tgtcaggttt aattgacatt ttaaaatgtc 
gaaacgaaaa gttattgtgt atgcagaaaa 
tgatattagt gaagcaaata ttcgtcgctg 
caaagcaaca acaaaatgtt ttacgggacc 
agctgtacta cgtttcgtca gtgagacacg 
aatgcaattg aaggcaggag aagttgccaa 
agccacaaga ggctggtgtg accgattcat 
aacatcgttt tgtcccaagc tccccactgc 
tttcaagaag tgctgcataa ccagtactct 
aaatgcagac atcaatgact gtggtttgaa 
tgaagttata attataactt aaccaattta 



aaccaacatg tgttttgttg gtatcaaatg 60 
ttcaaaaaga tcacactatg attcagcatt 120 
gcatggaaac agagcagcag ggcgtacatt 180 
gagaaatgat cgcaattcca tattttcctg 240 
taagaaaggt aggtacccac aagtagatga 300 
tgcaaaagga ttgcctatca cacgccaagc 360 
aaccctagga attgatgaaa caaaattcaa 420 
gcgtcgagca ggactatcat taaggcatca 480 
cattaaacag aagacagtgt tggagcactc 540 
tgacaacacc gggagagacg ttctgtggaa 600 
aagtgattca gaagagttgg attcagaata 660 
tttcacacat tttctttaca tatgcacaag 720 
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attgatgtga taaaaatctg tttaactcaa agcgctgttt caataaatat aaacattttc 780 
tgtgatatga aaaaaaaaaa 800 

<210> 80 

<211> 1163 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 40713 04CB1 

<400> 80 

atcaaaattt actctgcgag actctgctga actggctgca tcagggagaa atttcatctc 60 
ccagaagggt gttgctcatc gtttcttccc ggaaacatct gcagagacta gcttttcagg 120 
ctaaggtatc ctccatgatg ttaccattgc aaggtgccca gatgctgcag atgctggaga 180 
aatccttgag gaagagcctc ccagcatcct taaaggttta tggaactgtc tttcacataa 240 
accatggaaa tccattcaat ctgaaggctg tggtggacaa gtggcctgat tttaatacag 300 
tggttgtctg ccctcaggag caggatatga cagatgacct tgatcactat accaatactt 360 
accaaatcta ctccaaagat ccccaaaact gtcaggaatt ccttggatca ccagaactca 420 
tcaactggaa acagcattta cagattcaaa gttcacagcc tagcctgaat gaggctatac 480 
aaaatcttgc agccattaag tccttcaaag tcaaacaaac acaacgcatt etctatatgg 540 
cagctgaaac agccaaggaa ctgactcctt tcctgctgaa atcaaagatt ttatctccca 600 
gtggtggcaa acccaaggcc atcaaccaag agatgtttaa actctcatcc atggatgtta 660 
cccatgctca cttggtgaat aaattctggc attttggtgg taatgagagg agccagagat 720 
tcattgagcg ctgcattcag acctttccca cctgctgtct cctggggcct gaggggaccc 780 
ctgtgtgctg ggatctaatg gaccagactg gagagatgag aatggcaggc accttcgcgg 840 
aataccggct ccatggcctt gtgacgtatg tcatctattc ccacgcccag aaattgggca 900 
aacttgggtt tcctgtctat tctcatgtag actacagcaa tgaagctatg caaaaaatga 960 
gttacacact gcaacatgtt cccattccca gaagctggaa ccagtggaac tgtgtacctc 1020 
tgtgatgcca atcctgaaca taagacagtg ttgggcaggt ctgggcgtat aatttgagga 1080 
gtggatggtg gatgggaaga attaattaag cggtcgcaag cttattccct ttagtgaggg 1140 
ttaattttag cttgcaggct acg 1163 

<210> 81 

<211> 888 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4344970CB1 

<400> 81 ■ 
aaggactatg gcaggggaga acttcgctac gccgttccac gggcacgtgg gccgcggcgc 60 
cttcagcgac gtgtacgagc ccgcggagga cacgtttctg cttttggacg cgctcgaggc 120 
agcggctgcc gaactggcag gagtggaaat atgcctggaa gtagggtcag ggtctggtgt 180 
agtatctgca ttcctagcct ctatgatagg ccctcaggct ttgtacatgt gcactgatat 240 
_caaccctgag gcagcagctt gtaccctaga gacagcacgc- tgtaacaaag ttcacattca- 300 
accagttatt acagatttgg tcaaaggctt gctaccaaga ttgaccgaaa aagttgatct 360 _ 
tctggtgttt aatcccccct atgtagtgac tccacctcaa gaggtaggaa gtcacggaat 420 
agaggcagct tgggctggtg gcagaaatgg tcgggaagtc atggacaggt tttttcccct 480 
ggttccagat ctcctttcac caagaggatt attctattta gttaccatta aagaaaacaa 540 
cccagaagaa attttgaaaa taatgaagac aaaaggtctg caaggaacca ctgcactttc 600 
cagacaagca ggccaagaaa ctctttcagt cctcaagttc accaagtctt agcatacagt 660 
gtgtgcccag aactactgga aactgaatgc atttagcata ttttgaaact gaagtcattc 720 
attaggtaac aaggaatttt atcagaaatt tgtcattaaa aaaaggtaga aaggtggaac 780 
attccctttc agtcaggcct atagaattat ttgcacaagt acaagtaaat atgtgtatta 840 
tatttatagt ttgcatttta aatatagaaa ttatgtatga aaaaaaaa 888 

<210> 82 

<211> 1163 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



53/55 



WO 01/32888 



PCT/USOO/30485 



<223> Incyte ID No: 5392302CB1 



<400> 82 

gtggcgtctc 

agcacctttg 

cggcggtggt 

caaaaactgg 

ccaaccagtg 

gcaaagattc 

aatttacata 

aaggccagaa 

tgagggcagc 

ctccagcaac 

tcaatccagc 

tctgctgcaa 

atgctgggga 

taggggctga 

cagagaaagt 

ccttctacct 

tcattgcagc 

accttccgct 

aatgtacttg 

tctttgeaaa 



tggttacggg 
agcgatggcg 
agtggtgggc 
agaaaccatc 
tgtccaagct 
ttttggcaat 
tcagactaaa 
tatcattgtc 
agccaatgtc 
ttctttggaa 
ccctgccatt 
tgaaagtgag 
ggctgcatta 
aggatgtgtg 
caaggctgtg 
ggcttactat 
agtcagtgtc 
tactctgttt 
ggggtggctg 
tattagtatc 



gtcgggcaaa 
gcgtctgggg 
tcctgcatga 
catggacata 
gctcggcttg 
gattatatag 
gatgctgcta 
atagtggctg 
attagcagag 
gccctaacaa 
gctgacctgg 
gctgagattt 
gtgctcttga 
gtgctgtcac 
gataccacgg 
ccaaatctgt 
caggctgcag 
tgattgctat 
ctcctggcta 

ggg 



gggcagaggt 
aaccccagag 
ccgacctggt 
agttttttat 
gagcaatgac 
aaaacttaaa 
caggaactgc 
gagcaaattt 
ccaaagtcat 
tggcccgcag 
atccccagtt 
taactggcct 
aaaggggctg 
agacagaacc 
gtgctggtga 
cc ttggaaga 
gaacacagtc 
tagtcccaaa 
atgcttatta 



caccgtccag 
gcagtggcaa 
cagtcttact 
tggctttgga 
gtccatggtg 
acagaatgat 
ttctataatt 
acttttgaat 
ggtctgccag 
gagtggagtg 
ctacaccctc 
cacggtgggc 
ccaggtggta 
tgagccaaag 
cagctttgtg 
catgctcaac 
atcttaccct 
ataaatatac 
gaaaatgtcc 



gttggacagc 
gaggaggtgg 
tctcgtttgc 
gggaaaggtg 
tgtaaggttg 
atttctacag 
gtcaataatg 
acggaggatc 
ctcgaaataa 
aaaaccttgt 
tcagatgtgt 
agcgctgcag 
atcattacct 
cacattccca 
ggagctctgg 
agatccaatt 
tacaaaaaag 
ctgggaataa 
tcgtcccgta 



60 

120 

180 

240 ' 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1163 



<210> 83 
<211> 931 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5555235CB1 



<400> 83 

gcgaggacag 

ttccccgccc 

gaggatcccc 

cttctcccac 

acctggaccc 

tcccgctctg 

gactggcggc 

gcgaggcggc 

cccaaaccac 

aggaagaaat 

ggagttgaga 

cagatgtcat 

gtgttaaggg 

tttagggaca 

agtgcctttc 

taaa tgggga 



aggtgggctc 
cttccgtgtg 
aaccgcctgc 
ctgttcgact 
aacgggtgaa 
gctgtgctgc 
aatgccagcc 
aagaggagct 
ctatccccag 
cctaaatgct 
gccatgggtt 
acattctcag 
atgctccccc 
ttttatttta 
tctttcatca 
aactttttat 



aacaccaaga 
cacgctcccc 
gaaacccaag 
ccccatcccc 
gaaagacgat 
ctcccacttt 
acaggtgcag 
gcagaggagg 
tagatggccc 
ggtgtgggag 
tggacatgac 
ctgtcttgtt 
accccacatc 
aattaaattt 
cttgttctgg 
tcaaaaaaaa 



ggccaccaat 
acgaggggct 
ccgccgcgta 
aggatgtcaa 
gaggaggagg 
gcagtgcagg 
gcgttcaagg 
caagaacaag 
tgccaagacc 
gtctaaaaca 
tggcacaaac 
ccaccagtat 
ttaagtcagt 
acaatctaat 
atgtttcaat 



gaaaacgtga 
ggttcctgaa 
ggagcgtgcg 
cctcagtccc 
acccgctgga 
agtgcatggc 
attgcatgag 
ccggtgccca 
agcacccagc 
tggggagagt 
agctgtcata 
ttaccaggaa 
gtgccaagta 
ggtaaattga 
cagtttagtg 



ctttttcaca 60 
aggatgtaca 120 
ttcgggccct 180 
tcaaggccat 240 
ccagctgatc 3 00 
ccagcaccag 360 
tgaacagcag 420 
ccactgagac 480 
aagattatag 540 
ttttggatct 600 
tgttcatggt 660 
aacaaagaat 720 
ctgagatgat 780 
ttacctaatt 840 
gaaggaaaaa- 9 0 0 
931 



<210> 84 

<211> 1794 

<212> DNA , 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



5573296CB1 



<400> 84 

agctcttata 

ctaagtccat 

ttccttctgg 

atcatggacc 

gtctttattg 

cccattaaca 



aactagtggc 
gtgccaaagg 
aaatctttga 
tcgcgggact 
cc tcagggct 
agcagctctt 



aatttctgaa 
ctgccaggaa 
ctgtgggtag 
gctgaagtct 
aatcatcaac 
ccggaagatc 



cccagccggc 
ggagacgcct 
ttatttattt 
cagttcctgt 
accattcagc 
aactgcagac 



tccatctcag 
tcctgagtcc 
ctgaataaga 
gccacctggt 
tcttcactct 
tgtcctattg 



cttctggttt 60 
tggatctttc 120 
gcgtccacgc 180 
cttctgctac 240 
cctcctctgg 300 
catctcaagc 3 60 
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cagctggtga tgctgctgga gtggtggtcg 
cgcgcctacc tcaagtatgg gaaggaaaat 
attgactttc tgtgtggctg gagcctgtcc 
gtcctggcca agaaagagct ggcctatgtc 
gagatggtct tctgttcgcg caagtgggag 
cagcacctcc gggactaccc cgagaagtat 
ttcacggaga agaagcatga gatcagcatg 
ctcaagcatc acctgttgcc acgaaccaag 
aatgtagttt cagctgtata tgactgtaca 
ctgctgggag tcctaaacgg aaagaaatac 
ctggaagaca tccctgaaga cgatgacgag 
gagaaggatg cctttcagga ggagtactac 
gtgccccccc ggcggccctg gaccctcgtg 
taccctttct tccagttcct ggtcagcatg 
agcttcatcc tcgtcttctt tgtggcctct 
gaaattgaca agggctctgc ctacggcaac 
ctcagggagg tgtcaccatc cgaagggaac 
ccttagtggg acacggtgac aaaggctggg 
gacctctcca gccagggagt ctggtctcaa 
ctttttttcc ccatgtgctt tagtgggctt 
aatggctgtg tggtgagtgt gaactttgtt 
caggggaggg cagggctggg gaccgaaggg 
agttttctgt aaccettggc tgccagagat 
aaattatgcc tccaagaaaa aaaaattaaa 



ggcacggaat gcaccatctt cacggacccg 420 
gccatcgtgg ttctcaacca caagtttgaa 480 
gaacgctttg ggctgttagg gggctccaag 540 
ccaattatcg gctggatgtg gtacttcacc 600 
caggatcgca agacggttgc caccagtttg 660 
tttttcctga ttcactgtga gggcacacgg 720 
caggtggccc gggccaaggg gctgcctcgc 780 
ggcttcgcca tcaccgtgag gagcttgaga 840 
ctcaatttca gaaataatga aaatccaaca 900 
catgcagatt tgtatgttag gaggatccca 960 
tgctcggcct ggctgcacaa gctctaccag 1020 
aggacgggca ccttcccaga gacgcccatg 1080 
aactggctgt tttgggcctc gctggtgctc 1140 
atcaggagcg ggtcttccct gacgctggcc 1200 
gtgggagttc gatggatgat tggtgtgacg 1260 
tctgacagca agcagaaact gaatgactga 1320 
cttggggaac tggtggcctc tgcatatcct 1380 
tgagcccctg ctgggcacgg cggaagtcac 1440 
ggccggatgg ggaggaagat gttttgtaat 1500 
tggttttctt tttgtgcgag tgtgtgtgag 1560 
ctgtgatcat agaaagggta ttttaggctg 1620 
gacaagttcc cctttcatcc tttggtgctg 1680 
aaagtgaaaa gtgctttagg tgagatgact 1740 
gtgcttttct gggtcaaaaa aaaa 1794 
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Boxl Obs rvationswh r certain claims wer found unsearchabl (Continuation of it m 1 of first sh et) 


-- 


This International Search Report has not been established in respect of certain claims under Article 17(2){a) for the following reasons: 

1. fxl Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 

Although claim 18 are directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged effects of the 
compound/composi ti on . 

2. Pn Claims Nos.: 20,21,23,24 

— because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see PUKIrltK 1 IMrUKrlM 1 IUN sneet rLi/ioH/ciu 

3. Claims Nos.: 

■ — because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 


Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 


this International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1. 1 "1 As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — ' searchable claims. 

2. f~J As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. L 1 As only some of the required additional search fees were timely paid by the applicant, this lnterriational Search Report 
* L_J covers only those claims for which fees were paid, specifically claims Nos.: 

4. [77] No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
L * J restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

in part 1-19, 22, 25-28 

Remark on Protest The additional search fees were accompanied by the applicant's protest. 

f~| No protest accompanied the payment of additional search fees. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 



Invention 1; Claims: in part 1-19,22,25-28; all as far as 
applicable 

Polypeptide being a human transferase molecule and relating 
to SEQ ID NO 1, polynucleotide coding for said transferase 
and relating to SEQ ID NO 43; cell transformed with said 
polynucleotide; transgenic organism comprising said 
polynucleotide; method for producing said polypeptide; 
antibody binding to said polypeptide; methods for detecting 
a target nucleotide in a sample; pharmaceutical composition 
comprising said polypeptide; methods for screening for an 
agonist/antagonist of said polypeptide; method for screening 
for a compound binding to said polypeptide; method for 
screening for a compound that modulates the activity of said 
polypeptide; method for screening for a compound that alters 
the expression of a target nucleotide; method for assessing 
toxicity of a test compound 

Inventions 2-42; Claims: in part 1-19,22,25-28; all as far 
as applicable 

As invention 1 but limited to subject-matter relating to SEQ 
ID NOs 2-42 and 44-84; 

wherein invention 2 is limited to SEQ ID NOs 2 and 44; 
wherein invention 3 is limited to SEQ ID NOs 3 and 45; etc. 

and invention 42 is limited to SEQ ID NOs 42 and 84. 
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Continuation of Box 1.2 
Claims Nos.: 20,21,23,24 



Claims 20 and 23 refer to an agonist/antagonist of the polypeptide of 
claim 1 without giving a true technical characterization. Moreover, no 
such compounds are defined in the application. In consequence, the scope 
of said claims is ambiguous and vague, and their subject-matter is not 
sufficiently disclosed and supported (Art. 5 and 6 PCT). 
No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the result to be achieved. 
The above comment also applies to claims 21 and 24. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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